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Preface 


This book has been written for an introductory course in probability 
and statistics for students of engineering and the physical sciences. In the 
Second Edition, the authors have revised and updated the examples in the 
text, and set them apart from the explanatory material to make the book 
easier for students to use. All exercise material has been revised and addi- 
tional exercises have been added where needed. Applications to operations 
research have been woven into the chapters on distribution theory (Chap- 
ters 3 and 4) rather than being treated separately as in the First Edition. 
The treatment of Operating Characteristic Curves has been expanded 
(Chapter 7) and a section on Analysis of Covariance has been added 
(Section 12.6). Throughout, the text has been revised with the aim of 
improving clarity and bringing the applications up to date. It has been 
tested extensively in courses for university students as well as through 
in-plant training of practicing engineers. The authors have found that the 
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material in the book can be covered in a two-semester or three-quarter 
course consisting of three lectures a week. However, through a choice 
of topics the book also lends itself as a text for shorter courses with empha- 
sis upon either theory or applications. 

Chapters 2, 3, 4, and 6 provide a brief, though rigorous, introduction 
to the theory of statistics, and together with some of the material in 
Chapter 15, they are suitable for an introductory semester (or quarter) 
course on the mathematics of probability and statistics. Chapters 5, 7, 8, 9, 
and 10 contain conventional material on the elementary methods of 
statistical inference, including a treatment of nonparametric methods. 
Chapters 11, 12, and 13 comprise an introduction to some of the standard, 
though more advanced, topics of experimental statistics, while Chapters 
14 and 15 deal with special applications which have become increasingly 
important in recent years. 

The mathematical background expected of the reader is a year course 
in calculus; actually, calculus is required mainly for Chapters 4 and 6 
dealing with basic distribution theory in the continuous case, for Chapter 
15 dealing with applications to reliability theory, and for the least-squares 
methods of Chapters 11 and 12. The treatment of probability in Chapter 
2 is modern in the sense that it is based on the elementary theory of sets. 

The authors would like to express their appreciation and indebtedness 
to the D. Van Nostrand Company for permission to reproduce the mate- 
rial in Table 2; to the late Sir Ronald A. Fisher, F.R.S., Cambridge, and 
to Messrs. Oliver and Boyd, Ltd., Edinburgh, for permission to reprint 
Table 4 from their book, Statistical Methods for Research Workers; to 
Professor E.S. Pearson and the Biometrika trustees for permission to 
reproduce the material in Tables 5, 6, and 9; to A. H. Bowker and G. J. 
Lieberman and Prentice-Hall, Inc., for permission to reproduce Table 8, 
to Donald B. Owen and Addison-Wesley, Inc., for permission to reproduce 
part of the Table of Random Numbers from their Handbook of Statistical 
Tables; to Frank J. Massey, Jr., and the Journal of the American Statistical 
Association for permission to reproduce the material in Table 10; to D. B. 
Duncan,,H. L. Harter, and Biometrics for permission to reproduce Table 
12; to the American Society for Testing and Materials for permission to 
reproduce Table 13; and to the McGraw-Hill Book Company for permis- 
sion to reproduce the material in Table 14. 

The authors would also like to express their appreciation to the edi- 
torial staff of Prentice-Hall, Inc., for their courteous cooperation in the 
production of this book, to the various secretaries who helped in the typing 
of the manuscript, and above all to their wives for not complaining too 
much about the demands made on their husbands throughout the writing 


of this book. 
IRWIN MILLER AND JOHN E. FREUND 





Introduction 


.1 MODERN STATISTICS 


The origin of statistics can be traced to two areas of interest which, on the 
surface, have little in common: games of chance and what is now called 
political science. Mid-eighteenth century studies in probability (motivated 
largely by interest in games of chance) led to the mathematical treatment 
of errors of measurement and the theory which now forms the founda- 
tion of statistics. In the same century, interest in the numerical description 
of political units (cities, provinces, counties, etc.) led to what is now 
called descriptive statistics. At first, descriptive statistics consisted merely 
of the presentation of data in tables and charts; nowadays, it includes 
also the summarization of data by means of numerical descriptions. 

In recent decades, the growth of statistics has made itself felt in almost 
every major phase of human activity, and the most important feature of 


1 


2 Ch.1 Introduction 


its growth has been the shift in emphasis from descriptive statistics to the 
methods of statistical inference, or inductive statistics. Statistical inference 
concerns generalizations based on sample data; it applies to such problems 
as estimating an engine’s average emission of pollutants on the basis of 
some trial runs, testing a manufacturer’s claim on the basis of measure- 
ments performed on samples of his product, and predicting the fidelity 
of an audio system on the basis of sample data pertaining to the perfor- 
mance of its components. 

When one makes a statistical inference, namely, an inference which 
goes beyond the information contained in a set of data, one must always 
proceed with caution. One must decide carefully how far one can go in 
generalizing from a given set of data, whether such generalizations are 
at all reasonable or justifiable, whether it might be wise to wait until there 
are more data, and so forth. Indeed, some of the most important prob- 
lems of statistical inference concern the appraisal of the risks and the 
consequences to which one might be exposed by making generalizations 
from sample data. This includes an appraisal of the probabilities of 
making wrong decisions, the chances of making incorrect predictions, 
and the possibility of obtaining estimates which do not lie within per- 
missible limits. 

In recent years, attempts have been made to treat all these problems 
within the framework of a unified theory called decision theory. Although 
this theory has many conceptual as well as theoretical advantages, it poses 
some problems of application that are difficult to overcome. To understand 
these problems, one must realize that no matter how objectively an experi- 
mentor an investigation is planned, it is impossible to eliminate all elements 
of subjectivity. It is at least partly a subjective decision whether to base an 
experiment (say, the determination of a specific heat) on 5 measurements, 
on 12 measurements, or on 25 or more. Also, subjective factors invariably 
enter the design of equipment, the hiring of personnel, and even one’s 
deciding how to formulate a hypothesis and the alternative against which 
it is to be tested. (This important problem will be discussed in some detail 
in Chapter 7.) An element of subjectivity enters even when we define such 
terms as “good” or “best” in connection with the description of decision 
criteria—for instance, in Chapter 11 we shall ask for a straight line which 
“best” fits a given set of paired data. Above all, subjective judgments are 
virtually unavoidable when one is asked to put “cash values” on the 
various risks to which one is exposed. In other words, it is impossible to be 
completely objective in specifying “rewards” for being right (or close) and 
“penalties” for being wrong (or not close enough). After all, if a scientist 
is asked to judge the safety of a piece of equipment, how can he possibly 
put a cash value on the possibility that he might make an error, when such 
an error may lead to the loss of human lives? 
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Whether or not statistical inference is viewed within the broader 
framework of decision theory, it depends heavily on the theory of prob- 
ability. This is a mathematical theory, but the question of objectivity vs. 
subjectivity arises in its application and in its interpretation. As we shall 
see in Chapter 2, statements such as “there is a fifty-fifty chance that 
a given fabric will ignite when exposed to a cigarette ash” or “the prob- 
ability is 0.05 that a defective part will not be caught in the final inspection” 
can be looked upon as objective or subjective evaluations of the corre- 
sponding uncertainties. 

We shall approach the subject of statistics as a science, developing 
each statistical idea insofar as possible from its probabilistic foundation, 
and applying each idea to problems of physical or engineering science as 
soon as it has been developed. The great majority of the methods we 
shall use in stating and solving these problems belong to the classical 
approach, because they do not formally take into account the various 
subjective factors mentioned above; in selected applications we present 
also the Bayesian approach (see Sections 7.3 and 9.2) which accounts 
formally for at least some of these subjective factors. In any case, we shall 
endeavor continually to make the reader aware that the subjective factors 
do exist, and to indicate whenever possible what role they might play in 
making the final decision. Subjectivity plays an important role in the 
choice among statistical methods or formulas to be used in a given 
situation, in deciding on the size of a sample, in specifying the prob- 
abilities with which we are willing to risk errors, and so forth. This 
“bread-and-butter” approach to statistics presents the subject in the 
form in which it has so successfully contributed to engineering science, 
as well as to the natural and social sciences, in the last twenty years. 


1.2 STATISTICS AND ENGINEERING 


There are few areas where the impact of the recent growth of statistics 
has been felt more strongly than in engineering and industrial manage- 
ment. Indeed, it would be difficult to overestimate the contributions 
statistics has made to problems of production, to the effective use of 
materials and manpower, to basic research, and to the development 
of new products. As in the other sciences, statistics has become a vital tool 
to the engineer; in fact, some knowledge of statistics has become a neces- 
sity without which he cannot possibly appreciate, understand, or apply 
much of the work done in his field. 

In this text, our attention will be directed largely towards engineering 
applications, but we shall not hesitate to refer also to other areas, to 
impress upon the reader the great generality of most statistical techniques. 
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Thus, the reader will find that the statistical method which is used to 
estimate the coefficient of thermal expansion of a metal serves also to 
estimate the average time it takes a secretary to perform a given task, the 
average thickness of a pelican egg, or the average I.Q. of an immigrant 
arriving in the United States. Similarly, the statistical method which is 
used to compare the strength of two alloys serves also to compare the 
effectiveness of two teaching methods, the merits of two insect sprays, 
or the performance of men and women in a current-events test. 

In spite of the generality of most statistical techniques, there are also 
instances where the requirements of different fields have led to the develop- 
ment of special techniques. Thus, problems of economic forecasting led to 
special methods used in the analysis of series of business data; the phar- 
macological problem of determining critical dosages led to what is called 
probit analysis; problems in psychological testing led to factor analysis, 
and so forth. So far as engineering is concerned, we shall introduce the 
reader to three areas which have required the development of special 
techniques. In Chapter 14 we shall present some of the special methods 
used in problems of quality assurance, including the problem of controlling 
(establishing and maintaining) quality in mass production, the problem 
of establishing tolerance limits, and problems of sampling inspection. 
Then, in Chapter 15 we shall present special techniques which have been 
developed to meet the reliability needs of the highly complex products of 
space-age technology. The third area, operations research, is a new tech- 
nology which is characterized by the application of scientific techniques 
(including probability theory and statistics) to problems involving the 
operations of a “system” looked upon as a whole. Thus, it applies to the 
conduct of a war, the management of a firm, the manufacture of a product, 
and so forth. This new technology will not be presented in a special 
chapter, but some of its methods will be illustrated throughout the text 
in the exercises and examples. 





Probability 






C ‘ SAMPLE SPACES 

Inv statistics, a set of all the possible outcomes of an experiment is called 
a sample space, and it owes its name to the fact that in most instances it 
consists of all the things that can happen when one takes a sample. 
Sample spaces are usually denoted by the letter S. To avoid misunder- 
standings about the words “experiment” and “outcome” as they were 
used above, it should be understood that statisticians use these terms 
in a very wide sense. An experiment may consist of the simple process of 
noting whether a switch is turned on or off; it may consist of determining 
what proportion of a sample of patients shows an adverse reaction to a new 
medication; or it may consist of the very complicated process of finding 
the mass of an electron. Correspondingly, an outcome of an experiment 
may be a simple choice between two alternatives; it may be the result of 
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a measurement or count; or it may be an answer obtained after extensive 
measurements and calculations. 

When dealing with problems in which uncertainties are connected with 
the various outcomes, it is convenient to think of the outcomes of an 
experiment, the elements of the sample space, as points in a space of one 
or more dimensions. 


Example. Suppose that a government agency must decide where to 
locate two new atomic power plants, and that (for a certain purpose) 
it is of interest only how many of these plants will be located in 
Texas. The possible outcomes are 0, 1, and 2, and the sample space may 
be pictured as in Figure 2.1. If it were of interest to indicate also 
how many of the new power plants will be located in California, the 
possible outcomes could be denoted (0, 0), (1, 0), (0, 1), (2, 0), C1, 1), 
and (0, 2), where the first coordinate pertains to the number of new 
power plants to be located in Texas and the second coordinate pertains 
to the number of new power plants to be located in California. The 
corresponding sample space may be pictured as in Figure 2.2. 





Figure 2.1. Sample space for the number of new atomic power plants to 
be located in Texas. | 


In the second part of this example, the six possible outcomes could have 
been represented by means of six points on a line, or six points in any 
kind of pattern, but the use of coordinates and a corresponding number 
of dimensions has the advantage that it makes it easy to identify each 
outcome with the corresponding point. 

Generally, sample spaces are classified according to the number of 
elements (points) which they contain. Thus, the sample spaces of Figures 
2.1 and 2.2 have 3 and 6 elements, respectively, and they are both referred 
to as finite. Other examples of finite sample spaces are the one used to 
represent the various ways in which a president and a vice-president can 
be elected from among the 25 members of a union local (it has 600 ele- 
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Figure 2.2. Sample space for the number of new atomic power plants to be 
located in Texas and in California. 


ments); and the one used to represent the various ways in which a student 
can answer the 12 questions on a true-false test (the answer is 2!2 = 
4,096). The following are examples of sample spaces that are not finite: 


Example. Suppose that a person checks the nitrogen-oxide emission 
of automobiles and that we are interested in the number of cars he has 
to inspect before he observes the first one that does not meet govern- 
ment regulations. It could be the first, the second, ..., the fiftieth, 
..., and for all we know he may have to check a million or more 
cars before he finds one that does not meet government regulations. 
Not knowing how far we might have to go, it is appropriate in an 
example like this to consider the sample space to be the whole set of 
natural numbers, of which there is a countable infinity. 

To go one step further, if we were interested in the nitrogen-oxide 
emission of a given car in grams per mile, the sample space would have 
to consist of all the points on a continuous scale (a certain interval 
on the line of real numbers), of which there is a continuum. 


In general, a sample space is said to be discrete if it has finitely many or 
a countable infinity of elements. If the elements (points) of a sample 
space constitute a continuum, for example, all the points on a line, all 
the points on a line segment, all the points in a plane, and so forth, the 


8 Ch. 2 Probability 


Sy 


sample space is said to be continuous. In the remainder of this chapter 
we shall consider only discrete and mainly finite sample spaces. 

At times it can be quite difficult, or at least tedious, to determine the 
number of elements in a finite sample space by direct enumeration. The 
following illustrates a method that can often be used to simplify this 
task: | 


Example. A consumer testing service rates lawn mowers according 
to their ease of operation, retail price, and average cost of repairs. 
If E,, E,, and £, represent a lawn mower’s being easy, average, or 
difficult to operate, P, and P, represent its being expensive or inexpen- 
sive, and C,, C,, and C; represent its average cost of repairs being 
high, average, or low, the various ways in which a lawn mower can 
thus be rated may be visualized by means of the tree diagram of Figure 
2.3. Following a given path (from left to right) along the branches 





Figure 2.3. Tree diagram for ratings of lawn mowers. 





2.2 Events 


of the “tree,” we obtain a particular rating, namely, a particular ele- 
ment of the sample space, and it can be seen that altogether there 
are 18 possibilities. 


This result could also have been obtained by observing that there are three 
“E-branches,” that each E-branch forks into two “P-branches,” and that 
each P-branch forks into three “C-branches.” Thus, there are 3-2-3 = 18 
combinations of branches, or paths. This result is generalized by the 
following theorem: 





THEOREM 2.1. If sets A,;, Az,..., A, contain, respectively n,, Np, 
..., 1; elements, there are n,+n,*... °n, ways of selecting first an 
element from A,, then an element from A,,..., and finally an ele- 
ment from A,. 


This theorem can be verified by constructing a tree diagram similar to 
that of Figure 2.3, where we had n, = 3, n, = 2, n; = 3, and, hence, 
3-2-3 = 18 possibilities. 


Example. To illustrate the use of this theorem, consider again the two 
finite sample spaces referred to on pages 6 and 7. A union local with a 
membership of 25 can elect a president in 25 ways; subsequently it 
can elect a vice-president in 24 ways, and the whole selection can be 
made in 25-24 = 600 ways. So far as the true-false test with 12 
questions is concerned, each question can be answered in two ways 
so that there are altogether 2+2+2+2+2+2+2+2+2+2+2-2 = 2!2 = 
4,096 ways in which a student can answer the 12 questions of the test. 


Se # 
Probabilities are always associated with the occurrence or the nonoc- 


currence of events, such as the event that 8 of 20 road building jobs are 
finished on time, the event that a vacuum cleaner will not exceed a guar- 
anteed maximum level of sound, the event that we get 0 heads in 4 flips 
of a coin, the event that in one week there will be at least four industrial 
accidents in a given plant, and so forth. Thus, in connection with prob- 
abilities, we always refer to an individual outcome or to a set of outcomes 
of an experiment as an event. (The event of getting 0 heads in 4 flips 
of a coin pertained to an individual outcome while the event of there 
being at least four industrial accidents pertains to a set of outcomes.) 
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In other words, we shall think of an event as a subset of an appropriate 
sample space. 


Example. To illustrate this in more detail, suppose that the laboratory 
of an engineering firm has to check the suitability of 4 solid crystal 
lasers and 3 carbon dioxide lasers for a given task. If we are interested 
only in how many of each type are suitable (and not which ones in 
particular), the number of elements in the sample space is 5-4 = 20 
and the corresponding two-dimensional sample space may be pictured 
as in Figure 2.4. 





Figure 2.4. Events in a sample space. 


Now suppose we let R stand for the event that only 1 of all the 
lasers is suitable for the given task, we let T stand for the event that 
exactly 3 of the lasers are suitable for the given task, and we let U 
stand for the event that more of the solid crystal lasers than the 
carbon dioxide lasers are suitable for the given task. The elements 
of the sample space which correspond to these events are indicated, 
respectively, by means of the solid, dotted, and dashed curves of 
Figure 2.4. Note that events R and T have no elements in common— 
they are mutually exclusive events. 


In many problems of probability we are interested in events which are 
actually combinations of two or more events, formed by taking unions, 
intersections, and complements. Although the reader must surely be famil- 
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iar with these terms, let us review briefly that if A and B are any two sets 
in a sample space S, their union A U B is the subset of S which contains 
all elements that are either in A, in B, or in both; their intersection A (\ B 
is the subset of S which contains all elements that are in both A and B; 
and the complement A' of A is the subset of S which contains all the 
elements of S that are not in A. 


Continuation of Example. From Figure 2.4 it can be seen that RU T 
contains the elements (0, 1), (0,3), (1,0), (1, 2), (2, 1), and (3, 0), 
while R \ T has no elements at all. Following the practice of denoting 
the empty set, or null set, by the symbol @, we can write R 1 T= @, 
and this is the mathematical way of saying that R and T are mutually 
exclusive. Finally, U’ contains the elements (0, 0), (0, 1), (1, 1), (0, 2), 
(1, 2), (2, 2), (0, 3), (1, 3), (2, 3), and (3, 3), where in each case the 
number of solid crystal lasers that are suitable for the given task is 
less than or equal to the corresponding number of carbon dioxide 
lasers. Observe also that the complement of S with respect to itself 
is the null set, that is S’= @. 


Sample spaces and events, particularly relationships among events, 
are often depicted by means of Venn diagrams like those of Figures 2.5 





Figure 2.5. Venn diagrams. 
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. Figure 2.6. Use of Venn diagrams to verify that (A U B)’ = A’ - B’. 


and 2.6. In each case the sample space is represented by a rectangle, while 
events are represented by regions within the rectangle, usually by circles 
or parts of circles. Thus, the shaded regions of the four Venn diagrams 
of Figure 2.5 represent, respectively, the event A, the complement of 4, 
the union of A and B, and the intersection of A and B. 


Example. If A is the event that a certain student is taking a course in 
calculus and B is the event that she is taking a course in applied 
mechanics, then the region shaded in the first diagram of Figure 2.5 
represents the event that the student is taking a course in calculus, 
the region shaded in the second diagram represents the event that she is 
not taking a course in calculus, the region shaded in the third diagram 
represents the event that she is taking calculus and/or applied mechan- 
ics, and the region shaded in the fourth diagram represents the event 
that she is taking calculus as well as applied mechanics. 


Venn diagrams are often used to verify relationships among sets, thus 
making it unnecessary to give formal proofs based on the algebra of sets. 
To illustrate, let us demonstrate that (A U B)’ = A’ ~ B’, which expresses 
the fact that the complement of the union of two sets equals the inter- 
section of their respective complements. To begin, note that the shaded 
region of the first Venn diagram of Figure 2.6 represents the set (A U B)’ 
(compare this diagram with the third diagram of Figure 2.5). The cross- 
hatched region of the second Venn diagram of Figure 2.6 was obtained by 
shading the region representing A’ with lines going in one direction and 
that representing B’ with lines going in another direction. Thus, the 
crosshatched region represents the intersection of A’ and B’, and it can 
be seen that it is identical with the shaded region of the first Venn diagram 
of Figure 2.6. 
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EXERCISES 


JD A building inspector has to check the wiring in a new apartment building 
either on Monday, Tuesday, Wednesday, or Thursday, and at 8 A.M., 1 A.M., 
or 2 P.M. Draw a tree diagram which shows the various ways in which he can 
schedule the inspection of the wiring of the new apartment building. 


ff 

(2) If the five finalists in the Miss Universe contest are Miss Spain, Miss U.S.A., 
Miss Uruguay, Miss Portugal, and Miss Japan, draw a tree diagram which 
shows the various ways in which the judges can choose the winner and the 
first runner-up. 


3. A student can study either 0, 1, or 2 hours for a test in computer program- 
ming on any given night. Construct a tree diagram to show that there are 10 
different ways in which he can study altogether 6 hours for the test on four 
consecutive nights. 


4. If the tone quality of five audio systems is to be rated superior, average, or 
inferior and we are interested only in how many of the systems get each of 
these ratings, draw a tree diagram which shows the 21 different possibilities. 


5. Suppose that in a baseball World Series (in which the winner is the first 
team to win four games) the National League champion leads the American 
League champion two games to one. Construct a tree diagram which shows 
the number of ways in which these teams can win or lose the remaining 
games. 


6. In an optics kit there are six concave lenses, four convex lenses, and three 
prisms. In how many ways can one choose one of the concave lenses, one of 
the convex lenses, and one of the prisms ? 


(7) A questionnaire sent through the mail as part of a market study consists of 
eight questions, each of which can be answered in three different ways. In 
how many different ways can a person answer the eight questions on this 
questionnaire ? 


8. In each of the following experiments decide whether it would be appropriate 

to use a sample space which is finite, countably infinite, or continuous: 

(a) The amount of cosmic radiation to which passengers are exposed during 
a transcontinental jet flight is measured by means of a suitable counter. 

(b) Five of the members of a professional society with 12,600 members are 
chosen to serve on a nominating committee. 

(c) An experiment is conducted to measure the heat of vaporization of 
water. 

(d) A study is made to determine in how many of 450 airplane accidents the 
main cause is pilot error. 
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(ec) Measurements are made to determine the uranium content of a certain 
ore. 

(f) In a torture test, a watch is dropped from a tall building until it stops 
running. 


Two professors and three graduate assistants are responsible for the super- 
vision of a physics lab, and at least one professor and one graduate student 
has to be present at all times. 

(a) Using two coordinates so that (1, 2), for example, represents the event 
that one of the professors and two of the graduate assistants are present, 
and (2, 3) represents the event that both professors and all three graduate 
assistants are present, draw a diagram (similar to that of Figure 2.4) 
showing the points of the corresponding sample space. 

(b) Describe in words the event which is represented by each of the following 
sets of points of the sample space: the event B which consists of the 
points (1, 3) and (2, 3), the event C which consists of the points (1, 1) 
and (2, 2), and the event D which consists of the points (1, 2) and (2, 1). 

(c) With reference to part (b), list the points of the sample space which 
represent the event C U D, and describe this event in words. 

(d) With reference to part (b) are events Band D mutually exclusive? 


Mr. Jones, a salesman of industrial chemicals, has four customers in Sacra- 
mento, whom he may or may not be able to visit on a two-day visit to this 
city. He will not visit any of these customers more than once. 

(a) Using two coordinates so that (2, 1), for example, represents the event 
that he will visit two of his customers on the first day and one on the 
second day, and (0, 2) represents the event that he will not visit any of 
his customers on the first day but two on the second day, draw a diagram 
(similar to that of Figure 2.4) showing the points of the corresponding 
sample space. 

(b) List the points of the sample space of part (a) which constitute the fol- 
lowing events: event X that he will visit all four of his customers, event 
Y that he will visit more of his customers on the first day than on the 
second day, and event Z that he will visit at least three of his customers 
on the second day. 

(c) With reference to part (b), describe each of the following events in words 
and list the points which they contain: X’, X U Y, XM Z,and Xx’ Y. 

(d) With reference to part (b), check whether any two of the three events 
X, Y, and Z are mutually exclusive. ; 


Suppose that a group of efficiency experts is visiting a large firm and that A 
is the event that they inspect the accounting department, Q is the event that 
they inspect the quality control department, and R is the event that they 
inspect the research department. With reference to the Venn diagram of 
Figure 2.7 list (by numbers) the regions or combinations of regions which 
represent the following events: 

(a) The event that they will visit the accounting and quality control depart- 
ments, but not the research department. 
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Figure 2.7. Venn diagram for Exercises 11 and 12. 


(b) The event that they will visit the accounting department, but neither the 
quality control nor the research department. 

(c) The event that they will visit the accounting and research departments. 

(d) The event that they will visit the quality control department. 

(e) The event that they will visit none of the three departments. 


12. With reference to Exercise 11 and the Venn diagram of Figure 2.7, explain 
in words what events are represented by: 
(a) Region 1; (b) Region 5; 
(c) Regions 1 and 2 together; (d) Regions 3 and 5 together; 
(e) Regions 3, 5, and 6 together; (f) Regions 1, 2, 4, and 7 together. 


13. Use Venn diagrams to verify that 
(a) (AM BY =A’ UB’; (b) AUANB) =A: 
(Cc) (AN BU(AN B)D=A; 
(4) AUB=(AN B)U(AN B)U(A’N B); 
(e) AU(BNC)=(AVUBN(AUC). 


fo 





/).3 PROBABILITY 


In this section we shall define probabilities as the values of set functions, 
additive set functions to be exact. Since the reader is probably most familiar 
with functions for which the elements of the domain and the range are 

all numbers, let us first consider a very simple example where the elements 
of the domain are sets while the elements of the range are real numbers. 
Specifically, we shall study a function, a correspondence, which assigns 
real numbers to the subsets of a given set, namely, to the subsets of a 
sample space. 
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Example. The set function which we shall consider is the one which 
assigns to each subset A of a given finite set S the number of elements 
in A, written (A). Suppose, then, that 500 machine parts are inspected 
before they are being shipped, that 7 denotes that a machine part is 
improperly assembled, and D denotes that it contains one or more 
defective components. The distribution of these 500 machine parts 
among the various categories is as shown in the Venn diagram of 
Figure 2.8 and, using this diagram, we can now determine the value 





Figure 2.8. Classification of the 500 machine parts. 


N(A) for any one of the 16 subsets of the sample space. (To see that 
there are 16 subsets, consider the four regions into which the circles 
of Figure 2.8 divide the sample space. We must decide whether or 
not each region is to be included in any given subset; hence, there are 
2-2-2-2 = 16 possibilities, including the set S of all 500 machine parts 
and the empty set @.) The numbers in Figure 2.8 are the number of. 
machine parts that are improperly assembled but do not have any 
defective components, the number of machine parts that are improperly 
assembled and have one or more defective components, the number 
of machine parts that are properly assembled but have one or more 
defective components, and the number of machine parts that are 
properly assembled and do not have any defective components. 
Symbolically, 


NI D’) = 20, NI D) = 10, NO" CO). D) = d; 
and NU’ D’) = 465 


To find the total number of machine parts that are improperly assem- 
bled, we have only to add the number of improperly assembled 
machine parts that have one or more defective components to the 
number of improperly assembled machine parts which do not have 
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any defective components, and we obtain 
NI) = NIN D)+ NIA D’) = 10 + 20 = 30 


Similarly, we find that the total number of machine parts with one 
or more defective components is 


N(D) = NIN D)+ NU’ A D)=10+5=15 


Finally, since N(S) = 500, where S is the set of all 500 machine parts, 
we obtain by subtraction N(I’) = N(S) — N(I) = 500 — 30 = 470 
and N(D') = N(S) — N(D) = 500 — 15 = 485. 


The set function which we have introduced in this example is said to 
be additive, meaning that the number which it assigns to the union of two 
subsets which have no elements in common equals the sum of the numbers 
assigned to the individual subsets.* It was this property which enabled 
us to calculate N(Z) by adding NUM D) and NUM D’), N(D) by adding 
NWO D) and NU’ D), NJ’) by subtracting N() from N(S), and 
N(D") by subtracting N(D) from N(S). Note that this property applies 
only to subsets which have no elements in common, namely, to subsets 
which are disjoint; as we indicated on page 10, events that correspond 
to disjoint subsets are called mutually exclusive events. 

When dealing with two subsets A and B which may have some ele- 
ments in common, we must use the more general addition formula 





N(A U B) = N(A) + M(B) — N(A 2 B) 





where we subtracted N(A ~ B) because these elements are counted once 
in N(A) and once in N(B). 


Continuation of Example. For our illustration we thus get 
NWI U D) = NZ) + MD) — NWN D) 
= 30+ 15 — 10 
e= 35 
for the number of machine parts that are improperly assembled, have 


one or more defective components, or both. 


*A simple example of a set function which is not additive is the one which assigns to 
every subset the square of the number of elements which it contains. 
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Using the concept of an additive set function, let us now explain what 
we mean by the probability of an event. Given a finite sample space S 
and an event A in S, we define P(A), the probability of A, to bea value 
of an additive set function P, which must satisfy the following three 
conditions: 


SO 


Axiom 1. 0 < P(A) <1 for each event A in S. 
AXIOM 2. P(S) = 1. 


Axiom 3. If A and B are any mutually exclusive events in S, then 
P(A U B) = P(A) + PCB). 





We refer to such a function P as a probability function, and it should be 
noted that the first axiom states that a probability function assigns to 
every event A in S some real number from 0 to 1, inclusive. The second 
axiom states that the sample space as a whole is assigned the number 
1 and it expresses the idea that the probability of a certain event, an event 
which must happen, is equal to 1. The third axiom states that the prob- 
ability function must be additive, and with the use of mathematical 
induction it can be extended to include any finite number of mutually 
exclusive events. In other words, it can be shown that 





P(A, U A, U...U A,) = P(Ay) + P(Ax) +... + PCA,) 





where A,, A>,..., A, are mutually exclusive events in S. (In Section 3.3 
we shall explain how the third axiom will have to be modified so that we 
can consider also sample spaces which are not finite.) 

Axioms for a mathematical theory require no proof, but if such a 
theory is to be applied to the physical world, we must show somehow that 
the axioms are “realistic”; that is, we must show that they yield reasonable 
results. To do so with regard to the axioms of probability, let us say a few 
words about two of the most popular concepts of probability: the fre- 
quency interpretation and subjective probabilities. According to the widely- 
held frequency interpretation, a probability is looked upon as a relative 
frequency, or proportion, in the long run. 


Example. If we say “the probability that a jet from New York to Los 
Angeles will arrive on time is 0.82,” we mean that if present conditions 
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prevail, 82 percent of such flights will arrive on time. Also, if we say 
“the probability is 0.98 that a ceramic insulator will be able to with- 
stand a certain thermal shock,” we mean that 98 percent of the insula- 
tors should withstand such a thermal shock. Note, however, that if 
we say “the probability of getting heads with a balanced coin is 0.50,” 
this does not mean that we must necessarily get 10 heads and 10 tails 
in 20 flips of the coin or 50 heads and 50 tails in 100 flips; it means 
that if the coin is flipped a large number of times we should get close 
to 50 percent heads and 50 percent tails. 


To demonstrate that the three axioms of probability are consistent 
with the frequency interpretation, we need only observe that the propor- 
tion of the time an event occurs cannot be negative or exceed 1 and that 
one outcome or another must occur 100 percent of the time, that is, with 
a probability of 1. Also, if A and B are mutually exclusive events, the 
proportion of the time that either one or the other occurs is the sum of 
the proportions of the time that they occur. 


Example. If the proportion of voters favoring a piece of legislation 
is 0.42 and the proportion undecided is 0.19, then 0.42 + 0.19 = 0.61 
is the proportion of the voters who are either for the legislation or 
undecided. 


A point of view, which is currently gaining favor, is to interpret prob- 
abilities as personal or subjective evaluations. Such probabilities express 
the strength of one’s belief with regard to the uncertainties that are 
involved, and they apply especially when there is very little direct evidence, 
so that there may really be no choice but to consider collateral (indirect) 
information, educated guesses, and perhaps intuition and other subjective 
factors. Subjective probabilities are best determined by referring to 
a risk-taking situation and putting the issue on a “put up or shut up” 
basis. 


Example. For instance, to determine how a businessman may feel 
about the success of a new venture (say, the marketing of some new 
power tools), we ask him for the odds at which he would be willing 
to bet (or consider it fair to bet) that the new venture will be a success. 
If he “feels” that the proper odds are, say, 3 to 2, this means that it 
would be fair to bet $300 against $200, or perhaps $1,500 against 
$1,000, that the new venture will be a success. 


To convert such odds into the corresponding probability, we make use 
of the rule that if somebody considers it fair to bet a dollars against b dollars 
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that a given event will occur, he or she is, in fact, assigning the event the 
a 


a+b 





probability 


Continuation of Example. Thus, the businessman who considers it 

fair to bet $300 against $200 that the new power tools will be a com- 

mercial success, is actually assigning this event the subjective prob- 
es 300 7 

ability 300 200 = 0.60. 

So far as the three axioms of probability are concerned, the first one is 
a 
a+b 
exceed 1 so long as a and Bb are positive amounts bet for and against 
an event. With regard to the second axiom we can say that we should 
be willing to give “better” and “better” odds when we become more and 
more certain that an event will occur—say, 100 to 1, 1,000 to 1, or perhaps 

100 


consistent with the subjective approach since cannot be negative or 





even 1,000,000 to 1. The corresponding probabilities are 100-1 
1,000 1,000,000 ; 
000 1’ and 7,000,000 + 1 (or approximately 0.99, 0.999, and 


0.999999). Thus, the more certain we are that an event will occur, the closer 
its probability will be to 1. The third postulate of probability is not neces- 
sarily satisfied by subjective probabilities, as is illustrated by the following 
example: 


Example. Asked about the chances that a mining operation will 
be a great success or a modest success, a petroleum engineer “feels” 
that the odds are 1 to 5 that the operation will be a great success (or 
5 to 1 that it will not be a great success), 2 to 1 that it will be a modest 
success, and 3 to 1 that it will be either a modest success or a great 
J 2 2 


ies ] 
success. The corresponding probabilities are fs Soi S 


3 } ; , l Z 
and zi a and, as can easily be verified, 74 + z does not 
equal = 

4 


Thus, proponents of the subjective approach to probability impose the 
third axiom of probability as a consistency criterion. This means that if 
a person’s subjective probabilities “behave” in accordance with the 
third axiom of probability, he is said to be consistent; otherwise, he is 
said to be inconsistent and his probability judgments must be taken with 
a grain of salt. 
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Before we give some examples of useful probability functions, it is 
important to stress the point that the three axioms do not tell us how to 
assign probabilities to the various outcomes of an experiment, they merely 
restrict the ways in which it can be done. In actual practice, probabilities 
are assigned either on the basis of estimates obtained from past experi- 
ence, on the basis of a careful analysis of conditions underlying the 
experiment, on the basis of assumptions—say, the common assumption 
that various outcomes are equiprobable, or on the basis of subjective 
evaluation. 


Example. The following are three examples of permissible ways of 
assigning probabilities in an experiment where there are three possible 
and mutually exclusive outcomes A, B, and C: 


(1) P(A) = 4, P(B) = 4, P(C)=4 

(2) P(A) = 0.57, P(B) = 0.24, P(G) =019 
(3) P(A) = $4, P(B) = 3, P(C) = x 
However, 

(4) P(A) = 0.64, P(B) = 0,38, P(C) = —0.02 
and 

(5) P(A) = 0.35, PUB) = 0.52, P(C) = 0.26 


are not permissible because (4) violates Axiom 1 and (5) violates 
Axiom 2. 


As it can be shown that a sample space of n points (outcomes) has 2” 
subsets, it would seem that the problem of specifying a probability func- 
tion (namely, a probability for each subset or event) can easily become 
very tedious. Indeed, for m = 20 there are already more than a million 
possible events. Fortunately, this task can be simplified considerably by 
the use of the following theorem: 


THEOREM 2.2. If A is an event in the finite sample space S, then 
P(A) equals the sum of the probabilities of the individual outcomes 
comprising A. 
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To prove this theorem, let E,, F,,..., E, be the m outcomes comprising 
A, so that we can write 4d = E, U FE, U...U E,. Since the E’s are indi- 
vidual outcomes they are mutually exclusive, and by the extension of Axiom 
3 on page 18 we have 

P(A) = P(E, tC) Eek a 


= P(E,) + P(E.) +... + P,) 


which completes the proof. 


Example. To illustrate the use of this theorem, let us return to the 
rating of lawn mowers in the example on page 8, and let us suppose 
that the probabilities of the 18 different ratings are as shown in 
Figure 2.9 (which, except for the probabilities, is identical with Figure 





Figure 2.9. Ratings of lawn mowers and their probabilities. 
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2.3 on page 8). From this figure, we determine P(E,), the prob- 
ability that a lawn mower will be rated easy to operate by adding 
the probabilities of the six outcomes comprising the event £,, getting 


P(E,) = 0.07 + 0.13 + 0.06 + 0.05 + 0.07 + 0.02 = 0.40 


Similarly, we find that 


P(P,) = 0.07 + 0.13 + 0.06 + 0.07 + 0.14 
+ 0.07 + 0.02 + 0.03 + 0.01 = 0.60 
P(C,) = 0.07 + 0.05 + 0.07 + 0.08 + 0.02 + 0.01 = 0.30 
P(E, O P,;) = 0.07 + 0.13 + 0.06 = 0.26 
and 
P(E, O C,) = 0.07 + 0.05 = 0.12 


where the last value is the probability that a lawn mower will be rated 
easy to operate but having a high average cost of repairs. 


Example. To consider another illustration, let us refer again to the 
example on page 16, which dealt with the 500 machine parts, and 
let us suppose that one of the machine parts is to be chosen by lot for 
inspection. Assuming that each machine part, thus, has a probability 
of =4, of being selected, it can easily be verified by using Theorem 2.2 
that the probability that the machine part is improperly assembled 
is PZ) = #5, the probability that it contains one or more defective 
components is P(D) = zg5, and the probability that it is improperly 
assembled, contains one or more defective components, or both is 
PU U D) = 7h. (Using Theorem 2.2, each of these probabilities 
was obtained by adding ={y as many times as there are individual 
outcomes in the respective events.) 


In this last example we could have used to advantage the following 
theorem which applies to experiments where the individual outcomes are 
all equiprobable: 


THEOREM 2.3. Jf an experiment has n possible outcomes which are 
equiprobable and if s of these are labeled “success,” then the prob- 
ability of a “success” is s/n. 


This theorem follows immediately from Theorem 2.2 and the generaliza- 
tion of the third axiom of probability given on page 18. 
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Continuation of Example. We could, thus, have found directly that 
the probability of selecting a machine part which is improperly 
assembled is 
a: C) ee eee 
NS) 500 50 


and that the probability of selecting a machine part which is impro- 
perly assembled and also has one or more defective components is 


—_NUOD) 10 1 

PUI D) =Sis) = 300 = 30 

Theorem 2.3 is particularly useful in problems dealing with games of 
chance, where it is assumed that if a deck of cards is properly shuffled each 
card has the same chance of being selected, if a coin is properly flipped 
heads is as likely as tails, if a die is properly rolled each face is as likely to 
come up as any other, and so on. Thus, Theorem 2.3 applies also when 
selections are made by means of lots or other gambling devices to decide, 
say, which items coming off an assembly line are to be inspected, which 
test cars are to be assigned to each of four drivers, which plots of farm 
land are to be used to try a special variety of wheat, which coal-fired 
generating plants in a geographical area are to be used to experiment with 
antipollution devices, and so forth. 





2.4 SOME ELEMENTARY THEOREMS 


Using the axioms of probability, it is possible to derive many theorems 
which play an important role in applications. First let us show that 


THEOREM 2.4. If A is any event in S, then P(A’) = 1 — P(A). 





which, in accordance with the frequency interpretation, expresses the 
fact that if 34 percent of the shipments from a vendor arrive on time, 
then 66 percent of the shipments do not arrive on time, or if 16 percent 
of all applicants fail a certain civil service test, then 84 percent do not 
fail the test. So far as subjective probabilities are concerned, Theorem 2.4 
expresses the fact that if the odds for the occurrence of an event are 8 to 
1, then the odds against its occurrence are 1 to 8; the corresponding 


: and : and — l mi To prove 


ore 8 
probabilities are ee ar Teo a : 
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Theorem 2.4, observe that A and A’ are mutually exclusive by definition, 
and that A U A’ = S (that is, among them, 4 and JA’ contain all of the 
elements of S). Hence, we have 


P(A U A’) = P(A) + P(A’ 


according to Axiom 3, 


PA WA} = PS) = 1 
according to Axiom 2, and it follows that 
P(A) + P(A’) = 1 


which completes the proof of Theorem 2.4. As a special case we find that 
P(@) = 1 — P(S) = 1—1=0, since the empty set @ is the comple- 
ment of S. 


Example. Referring again to the lawn mowers and the results on page 
23, we thus find that the probability that a lawn mower will not be 
rated easy to operate is 1 — P(E,) = 1 — 0.40 = 0.60, and that the 
probability that a lawn mower will be rated as either not being easy to 
operate or not having a high average cost of repairs is 1 — P(E, \ C;) 
=]— 0.12 = 0.88. 


On page 18 we observed that Axiom 3 can be extended to include 
more than two mutually exclusive events. Another useful and important 
extension of this axiom allows us to find the probability of the union of 
any two events in S regardless of whether they are mutually exclusive. 
To motivate the theorem which follows, let us consider again the example 
on page 17, the one dealing with the 500 machine parts, where we 
obtained the number of machine parts that are improperly assembled, 
have one or more defective components, or both, by writing 


NZI U D) = NI) + N(D) — NWA D) 
= 30+ 15— 10 
= 35 


As we pointed out in connection with the general formula for N(A U B) 
on page 17, we had to subtract NU ™ D), the number of machine parts 
that are improperly assembled and also contain one or more defective 
components, because they would otherwise be counted twice, once in 
N(I) and once in N(D). If we divide each of the above figures by 500, the 
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total number of machine parts, we could similarly have argued that the 
proportion of machine parts that are improperly assembled, have one or 
more defective components, or both, equals the proportion of machine 
parts that are improperly assembled plus the proportion of machine parts 
that contain one or more defective components minus the proportion of 
machine parts that fit both of these descriptions. In line with this intuitive 
justification, let us now prove the following theorem, which is usually 
called the general law of addition: 


THEOREM 2.5. Jf A and B are any events in S, then 


P(A U B) = P(A) + P(B) — P(A B) 


To prove this theorem, note first from the Venn diagram of Figure 2.10 
that 


AUVUB=(ANBUAN B)U (40 B) 
and also that 


A=(ANM B)U(AN B’) 
B=(AN B)U(A' OB) 


(The reader was asked to verify these important relations in parts (c) and 
(d) of Exercise 13 on page 15.) Since A 1 B, AM B’, and A’ B are 
evidently mutually exclusive, the extension of Axiom 3 on page 18 yields 


P(AU B)=P(AN B+ PAN B)+4+ P(A’ OC B) 





Figure 2.10. Diagram for proof of Theorem 2.5. 
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and after adding and subtracting P(A ~ B) we obtain 


P(A U B)=[P(A 0 B) + P(AN BY + [P(A 2 B) 
+ P(A’ Q B)] — P(A 17 B) 
= P(A) + P(B) — P(A 2 B) 


Note that when A and B are mutually exclusive, Theorem 2.5 reduces to 
Axiom 3, since in that case P(A ~ B) = 0. For this reason, we often refer 
to Axiom 3 as’the special law of addition, whereas Theorem 2.5 is called 
the general law of addition. 


Example. To illustrate the use of this theorem, let us refer again 
to the lawn mower example and let us determine the probability that 
a lawn mower will be rated easy to operate, having a high average cost 
of repairs, or both. Making use of the results obtained on page 23, 
namely, P(E,) = 0.40, P(C,) = 0.30, and P(E, \ C;) = 0.12, we can 
immediately write the answer as 


P(E; U C,) = 0.40 + 0.30 — 0.12 
= 0.58 


Example. To give another example, suppose that the probabilities are, 
respectively, 0.28, 0.15, and 0.09 that a person traveling through the 
state of Arizona will visit the Grand Canyon, the Petrified Forest, or 
both. Substituting these values into the formula of Theorem 2.5, we 
find that the probability is 


0.28 + 0.15 — 0.09 = 0.34 


that a person traveling through Arizona will visit at least one of these 
scenic attractions. 


EXERCISES 


1, With reference to the example on page 16 and Figure 2.8, list the 16 differ- 
ent subsets that can be formed and the number of elements (machine parts) 
| /@ontained in each. : 


A 2,/In a group of 160 graduate engineering students, 92 are enrolled in an 
“ advanced course in statistics, 63 are enrolled in a course in operations 
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research, and 40 are enrolled in both. How many of these students are not 
enrolled in either course? 


3. The personnel manager of a manufacturing plant claims that among the 400 
employees 312 got a raise in 1975, 248 got increased pension benefits, 173 
got both, and 43 got neither. Draw a Venn diagram and fill in the figures 
which correspond to the various regions to check whether this claim should 

e questioned. 







mong 150 persons interviewed as part of an urban mass transportation 

f study, some live more than three miles from the center of the city (A), some 
now regularly drive their own car to work (B), and some would gladly switch 
to public mass transportation if it were available (C). Use the information 
given in Figure 2.11 to find 


(a) N(A); 57 (b) N(B); 77 

(c) N(C); £5 — (d) N(A 0 B);77 

(:) N(ANC);6 2 (f) NAN BNC); 6 7 
(2) N(AU B); /O7 (h) N(BUC); / 4/ 

(i) N(A’ UB UC); 4/ (j) NIBA (AU CY. SL 





Figure 2.11. Diagram for Exercise 4. 


a ; +05. An experiment has the four possible mutually exclusive outcomes A, B, C, 
and D. Check whether the following assignments of probability are permis- 
sible: 

(a) P(A) = 0.38, P(B) = 0.16, P(C) = 0.11, P(D) = 0.35; 
(b) P(A) = 0.31, P(B) = 0.27, P(C) = 0.28, P(D) = 0.16; 
(c) P(A) = 0.32, P(B) = 0.27, P(C) = —0.06, P(D) = 0.47; 
(d) P(A) = 4, P(B) = 4, P(C) = §, PID) = Zs; 

(ec) P(A) = Pg, P(B) = 4, P(C) = 4, P(D) = §. 


6. Referring to the example on page 10 and the sample space of Figure 2.4, 
suppose that the points (0, 0), (1, 0), (2, 0), (3, 0), (4, 0), (0, 1), C, 1), (2, 1), 
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‘a> C8) 


10. 


11. 


12. 


13. 


(3, 1), (4, 1), (0, 2), (1, 2), (2, 2), (3, 2), (4, 2), (0, 3), C1, 3), (2, 3), (3, 3), and 
(4, 3) of the sample space are assigned the respective probabilities 0.01, 0.01, 
0.01, 0.03, 0.04, 0.01, 0.01, 0.01, 0.03, 0.04, 0.03, 0.03, 0.03, 0.09, 0.09, 0.05, 
0.06, 0.07, 0.14, and 0.21. 

(a) Verify that this assignment of probabilities is permissible. 

(b) Calculate the probabilities of events R, T, and U described on page 10. 
(c) Calculate the probabilities that, respectively, 0, 1, 2, or 3 of the carbon 

dioxide lasers will be suitable for the given task. 


. Referring to the example on page 8 and Figure 2.9, find 


(a) P(E2); (b) P(P2); 
(c) P(C2); (d) P(E, M P2); 
(e) P(E, N C4); (f) PWP2O C2). 


} With reference to Exercise 9 on page 14 suppose that each point (/,/) of 
~ sample space (for i = 1 or 2, and j = 1, 2, or 3) is assigned the probability 
38 

i+7z 

(a) Verify that this assignment of probabilities is permissible. 

(b) Calculate the probabilities of events B, C, and D described in that 
exercise. 

(c) Calculate the probabilities that, respectively, 1, 2, or 3 of the graduate 
students will be supervising the physics lab. 





. Explain how one might conceivably assign a probability to the success of an 


off-shore oil drilling operation, using 
(a) the frequency concept of probability; 
(b) subjective probabilities. 


Discuss whether it is possible for two persons to arrive at correct, yet differ- 
ent, probabilities regarding the commercial success of a new latex paint, if 
they both use 

(a) the frequency concept of probability; 


_(b) subjective probabilities. 


If a student is willing to bet $3 against $1, but not $4 against $1 that he will 
get a passing grade in a certain course, what does this tell us about the sub- 
jective probability he assigns to his getting a passing grade in the course? 


The supplier of delicate optical equipment feels that the odds are 7 to 5 
against a shipment arriving late and 11 to 1 against it not arriving at all. 
Furthermore, he feels that there is a fifty-fifty chance (the odds are 1 to 1) 
that such a shipment will either arrive late or not at all. Are the correspond- 
ing probabilities consistent ? | 


There are two Ferraris in a race, and an expert feels that the odds against 
their winning are, respectively, 2 to 1 and 3 to 1. Furthermore, she claims 
that there is a less-than-even chance that either of the two Ferraris will win. 
Discuss the consistency of these claims. 


X* 52 (i4) 
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When rolling a pair of balanced dice, what are the probabilities of getting 


(a) 7; (b) 11; 
(c) 7or 11; (ad) 2 or 12: 
(e) 3; (f) 2 or 3 or 12? 


9:33 (5) a card is drawn from a well-shuffled deck of 52 playing cards, find the 


eS ? 19, 


16. 


17. 


18. 


. § 20. 


): 


21. 


22. 


probabilities of drawing 


(a) ared king; (b) a black card; 
(c) a 3, 4, 5, or 6; (d) a red ace or a black 7, 
A lottery sells tickets numbered from 00001 through 50000. What is the 


probability that the number drawn will be divisible by 200? 


Referring to Exercise 4 and Figure 2.11, suppose that the questionnaire 
filled in by one of the 150 persons interviewed is to be double-checked. 
Assuming the questionnaire is chosen by lot, find the probabilities that 

(a) the person lives more than three miles from the center of the city; 

(b) the person now regularly drives his or her car to work; 

(c) the person does not live more than three miles from the center of the 
city and would not want to switch to public mass transportation if it 
were available; 

(d) the person now regularly drives his or her car to work but would gladly 
switch to public mass transportation if it were available. 


If A and B are mutually exclusive events, P(A) = 0.23, and P(B) = 0.51, 
find 


(a) P(A’); (b) P(A U B); 
(c) P(A B); (d) P(A’ A B’). 
Given P(A) = 0.35, P(B) = 0.73, and P(A - B) = 0.14, find 
(a) P(A U B); (b) P(A B’); 
(c) P(A’ U BY); (d) P(A’ 0 B). 


The probability that a person stopping at a gas station will ask to have his 

tires checked is 0.12, the probability that he will ask to have his oil checked 

is 0.29, and the probability that he will ask to have them both checked is 

0.07. 

(a) What is the probability that a person stopping at this gas station will 
have either his tires or his oil checked ? 

(b) What is the probability that a person stopping at this gas station will 
have neither his tires nor his oil checked? 


The probability that a new airport will get an award for its design is 0.16, 
the probability that it will get an award for the efficient use of materials is 
0.24, and the probability that it will get both awards is 0.11. 

(a) What is the probability that it will get at least one of the two awards? 
(b) What is the probability that it will get only one of the two awards? 


The probability that at least one of three events A, B, and C will occur is 
given by 
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P(A U BU C) =P(A) + PB) + P(C) — PA B)- PAN C) 
—-PBOAC)+PANBNC) 


Verify this formula with the probabilities shown in Figure 2.12. 





Figure 2.12. Diagram for Exercise 22. 


23. Suppose that in the maintenance of a large medical-records file for insurance 
purposes the probability of an error in processing is 0.0010, the probability 
of an error in filing is 0.0009, the probability of an error in retrieving is 
0.0012, the probability of an error in processing as well as filing is 0.0002, the 
probability of an error in processing as well as retrieving is 0.0003, the prob- 
ability of an error in filing as well as retrieving is 0.0003, and the probability 
of an error in processing, filing, as well as retrieving is 0.0001. What is the 
probability of making at least one of these errors? 


24. Referring to parts (c) and (d) of Exercise 13 on page 15, show that 
(a) P(A) > P(A NZ B); (b) P(A) < P(A U B). 


25. Explain why there must be a mistake in each of the following statements: 

(a) If the probability that an ore contains uranium is 0.28, the probability 
that it does not contain uranium is 0.62. 

(b) A company is working on the construction of two shopping centers; the 
probability that the larger of the two shopping centers will be completed 
on time is 0.35, and the probability that both shopping centers will be 
completed on time is 0.42. 

(c) The probability that a student will get an A in a geology course is 0.32, 
and the probability that he will get either an A ora B is 0.27. 

(d) In a certain city, the probability that a family has at least one color 
television set is 0.82, the probability that a family has at least one black- 
and-white television set is 0.64, and the probability that a family has at 
least one of each kind is 0.41. 
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.) CONDITIONAL PROBABILITY 


As we have defined probability, it is meaningful to ask for the prob- 
ability of an event only if we refer to a given sample space S. To ask for 
the probability that an engineer makes at least $20,000 a year is meaning- 
less unless we specify whether we are referring to all engineers in the 
Western hemisphere, all engineers in the United States, all those in a 
particular industry, all those who are affiliated with a university, and so 
forth. Thus, when we use the symbol P(A) for the probability of A, we 
really mean the probability of A given some sample space S. Since the 
choice of S is by no means always evident, and since there are problems 
in which we are interested in the probabilities of A with respect to more 
sample spaces than one, the notation P(A|S) is used to make it clear 
that we are referring to the particular sample space S. We read P(A| S) 
as “the conditional probability of A relative to S” and every probability 
is, thus, a conditional probability. Of course, we use the simplified nota- 
tion P(A) whenever the choice of S is clearly understood. 


Example. To illustrate some of the ideas connected with conditional 
probabilities, let us consider again the 500 machine parts of which 
some are improperly assembled and some contain one or more 
defective components as shown in Figure 2.8. Assuming equal prob- 
abilities in the selection of one of the machine parts for inspection, 
we saw on page 23 that the probability of getting one with one or 
more defective components is P(D) = a5. Let us now check whether 
this probability changes if the choice is restricted to the machine parts 
that are improperly assembled. To find P(D|/) we have only to look 
at the reduced sample space of Figure 2.13, and to assume that each of 
the 30 improperly assembled machine parts, of which 10 contain one 
or more defective components, has the same probability of being 





Figure 2.13. Reduced sample space. 
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selected. Using Theorem 2.3, we get 


_~NDATI)_ 10_ 1 
Ee Ny = 30 

Thus, the probability of getting a machine part with one or more 
defective components increases markedly on the reduced sample 
space J. Note that if we divide the numerator and the denominator of 
the last expression by N(S), we get 


N(D OI) 
_ NMS) _ PDAD) 
oa cane 0a 
NS) 


and it can be seen that the desired probability is the ratio of the prob- 
ability that the machine part selected contains one or more defective 
components and belongs to the reduced sample space J to the prob- 
ability that it belongs to J. 

Looking at this example in another way, note that with respect to 
the whole sample space S we have 


ty 4 
pon n=Aae = 

MD'AD 2 
PLD 1) = nt =z 


assuming, as before, that each of the 500 machine parts has the same 
probability of being selected for inspection. Thus, the probabilities 
that the machine part selected does or does not contain one or more 
defective components, given that it is improperly assembled, should 
be in the ratio 1: 2. Since all probabilities in the reduced sample space 
must add up to 1, it follows that 


P(D\|I)=4 and P(D'|I) =3 


which agress with the result obtained before. This also explains why 
we had to divide by P(J) when we wrote 


_ PDN!) 
above. Division by P(/), or multiplication by pM takes care of the 


PCI)’ 
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proportionality factor which makes the sum of the probabilities over 
the reduced sample space equal to 1. 


Following these observations, let us now make the following formal 
definition: 


If A and B are any events in S and P(B) ¥ 0, the conditional prob-. 
ability of A relative to B is given by 


P(A ~ B) 


P(A|B) = PCB) 


The following are some immediate applications of this formula: 


Example. If the probability that a communication system will have 
high selectivity is P(A) = 0.54, the probability that it will have high 
fidelity is P(B) = 0.81, and the probability that it will have both is 
P(A ~ B) = 0.18, then the probability that a system with high fidelity 
will also have high selectivity is 


P(A B) _ 0.18 2 
== say = ORL =o 
and the probability that a system with high selectivity will also have 
high fidelity is 


PB 4) = FO 4 = as 


Example. If the probability that a research project will be well planned 
is 0.80 and the probability that it will be well planned and well executed 
is 0.72, then the probability that a research project which is well 


O12 
080 = 0.90. 


Example. Returning to the lawn mower example, for which the 
probabilities of the individual outcomes are given in Figure 2.9 on 
page 22, let us now determine the probability P(E,|C,) that a lawn 
mower will be easy to operate given that it has a high average cost of 
repairs. Using the results obtained on page 23, we get 


planned will also be well executed is —— 


P(E, 1 C,) __ 0.12 


P(C;) 030 Wee 


P(E, |C;) = 
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and it is of interest to note that this equals the value which we obtained 
earlier for P(E,). This means that the probability of a lawn mower’s 
being rated easy to operate is the same regardless of whether or not 
it is given that it has a high average cost of repairs, and we say that 
E, is independent of C,. As the reader will be asked to verify in Exercise 
5 on page 41, it also follows from the results on page 23 that £, is 
not independent of P;, namely, that a lawn mower’s ease of operation 
is related to its retail price. 


In general, if A and B are any two events in a sample space S, we say 
that A is independent of B if and only if P(A|B) = P(A), and since it can 
be shown that B is independent of A whenever A is independent of B, 
namely, that P(A|.B) = P(A) implies P(B| A) = P(B) provided P(A) # 0, 
it is customary to say simply that A and B are independent. 

An immediate consequence of our definition of conditional probability 
is given by the following theorem, usually called the general law of mul- 
tiplication: 


I 


THEOREM 2.6. If A and B are any events in S, then 


P(A B) = P(A):P(B|A) if P(A) #0 
— P(B):P(A|B) if P(B) #0 





The second of these relations follows from the definition of conditional 
probability, multiplying both sides by P(B); the first is obtained by inter- 
changing the letters A and B in the definition of conditional probability 
and then multiplying both sides by P(A). Note that the definition of 
P(B| A) and P(A|B) assumes that P(A) 4 0 and P(B) ¥ 0, respectively. 


Example. To illustrate the use of Theorem 2.6, suppose that the 
foreman of a group of 20 construction workers wants to get the opin- 
ion of two of them (to be selected at random) about certain new safety 
regulations. If 12 of them favor the new regulations while the other 
8 are against them, let us find the probability that both of the workers 
chosen by the foreman will be against the new safety regulations. 
Assuming equal probabilities for each selection, the probability that 
the first construction worker selected will be against the new safety 
regulations is 8, and the probability that the second construction 
worker selected will be against the new safety regulations given that 
the first one is against them is 7. Substituting these values into the 
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formula of Theorem 2.7, we find that the desired probability is 8-4 = 
$¢. Similarly, the probability that both of the construction workers 
will be for the new safety regulations is 42-44 = 33, and, by sub- 
traction, we find that the probability that one of them will be for the 


new safety regulations and one of them will be against them is 1 — 


14 55 2 48 
ie os 3S* 


In the special case where A and B are independent, Theorem 2.6 leads 


to the following, which is usually called the special law of multiplication: 


THEOREM 2.7. If A and B are independent events, then 


P(A A B) = P(A)+P(B) 


Example. For instance, the Se a of getting two heads in two 
successive flips of a coin is 4-4 =4, and the probability of ne 
two aces in a row from a standard deck of 52 playing cards is 4-4 = 
t%9 provided the first card is replaced before the second is drawn. 


The special law of multiplication is readily extended to apply to more than 
two independent events: if three or more events are independent, the prob- 
ability that they will all occur is given by the product of their respective 
probabilities. 


Example. For instance, the probability of getting four heads in a row 
with a balanced coin is 4+4-4-4 = ;4, and the probability of rolling 


a 3,a1,a6, and then some other number in four rolls of a balanced die 


om 
IS S°S°S'S = FT 


2.6 THE RULE OF BAYES 


The general law of multiplication is useful in solving many problems 


in which the ultimate outcome of an experiment depends on the outcomes 
of various intermediate stages. 


Example. Suppose, for instance, thai an assembly plant receives its 
voltage regulators from two different suppliers, 75 percent from 
supplier B, and 25 percent from supplier B,. In other words, the 
probability that any one voltage regulator recevied by the plant comes 
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from supplier B, is } and the probability that it comes from supplier 
B, is 4. Suppose, furthermore, that 95 percent of the voltage regulators 
supplied by B, and 80 percent of those supplied by B, perform accord- 
ing to specifications. What we would like to know is the probability 
of the event A that any one voltage regulator received by the plant will 
perform according to specifications. 

If we use the fact that A = (A 1 B,) U(A MO B,) and that AN B, 
and A B, are mutually exclusive (see Figure 2.10 with B, and B, 
taking the place of B and B’), the special law of addition yields 


P(A) = P(A B,) + P(A 2 B,) 


Then, if we apply the general law of multiplication to P(A ™ B,) and 
P(A ~ B,), we obtain 


P(A) = P(B,):P(A|B) 7 P(B,)+ P(A |B) 


and substituting the given probabilities P(B,) = 3, P(B,) = 4, P(A| B1) 
= 0.95, and P(A|B,) = 0.80, we finally get 


P(A) = 3(0.95) + 4(0.80) = 0.9125 


for the desired probability that any one voltage regulator received by 
the plant will perform according to specifications. 


In the preceding example there were only two alternatives at the inter- 
mediate stage, B, and B,. In general, if there are n mutually exclusive 
alternatives B,, B,,..., B, at the intermediate stage, the corresponding 
formula for the probability of the final outcome, event A, is given by 





P(A) = 3) P(B,)-P(A|B) 





sometimes called the rule of elimination. To visualize this result, we have 
only to construct a tree diagram like that of Figure 2.14, where the prob- 
ability of the final outcome is given by the sum of the products of the 
probabilities corresponding to each individual branch. 


Continuation of Example. To illustrate, suppose that there had been 
three suppliers in our example and that the required probabilities are 
as shown in Figure 2.15. Then, the probability that any one voltage 


38 Ch. 2 Probability 





Figure 2.15. Tree diagram for example with three suppliers of voltage 
regulators. 


regulator received by the plant will perform according to specifications 
1S 

P(A) = (0.60)(0.95) + (0.30)(0.80) + (0.10)(0.65) 
0.875 


I 


To consider a problem which is closely related to the one we have 
just discussed, suppose we want to know the probability of the event 
that a particular voltage regulator which is known to perform according 


to specifications came from supplier B,. Symbolically, we want to know 
the value of P(B;| A). To solve this problem we write 


P(B,| A) = “SB 


and then substitute 


P(A ™ B;) = P(B;)-P(A|B3) 


39 2.6 The Rule of Bayes 


and 


P(A) = ¥) P(B)-P(A|B) 


respectively, into the numerator and the denominator in accordance 
with the general law of multiplication and the rule of elimination. 
We thus obtain the formula 
P(B, | A) _ ECB EAB) 
pe P(B;):P(A|B;) 


i=1 


which expresses the required probability in terms of given probabilities. 
Substituting the values from Figure 2.15, we finally obtain 


(0.10)(0.65) 
(0.60)(0.95) - (0.30)(0.80) + (0.10)(0.65) 


= 0.074 


P(B;|A) = 


Note that the probability that a voltage regulator is supplied by B; 
decreases from 0.10 to 0.074 once it is known that it performs accord- 
ing to specifications. 


The method used to solve the last example can easily be generalized to 
yield the following formula, called the rule of Bayes: 


THEOREM 2.8. Jf B,, B,,..., B, are.mutually exclusive events of 


which one must occur, that is, >, P(B;) = 1, then 
i=1 


P(B,)-P(A|B,) 


P(B,| A) = nh 
> P(B;): P(A|B;) 


FORT Shs 2. oy OF Te 





This rule provides a formula for finding the probability that the 
“effect” A was “caused” by the event B,. For instance, in our example 
we found the probability that an acceptable voltage regulator was made 
by supplier B;. The probabilities P(B;) are called the “prior” or “a priori” 
probabilities of the “causes” B,, and in practice it is often difficult to assign 
them numerical values. For many years Bayes’ rule was looked upon with 
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suspicion because it was used with the often-erroneous assumption that 
the prior probabilities are all equal. A good deal of the controversy once 
surrounding the rule of Bayes has been cleared up with the realization that 
the probabilities P(B;) must be determined separately in each case from 
the nature of the problem, preferably on the basis of past experience. We 
shall return to this problem, as it relates to Bayesian inference, in Chapters 
7 and 9, 


Example. To further illustrate the rule of Bayes, suppose that the four 
attendants at a gasoline service station are supposed to wash the 
windshield of each customer’s car. Jack, who services 20 percent of 
all cars, fails to wash the windshiels 1 time in 20; Tom, who services 
60 percent of all cars, fails to wash the windshield 1 time in 10; George, 
who services 15 percent of all cars, fails to wash the windshield 1 time 
in 10; and Peter, who services 5 percent of all cars, fails to wash the 
windshield 1 time in 20. If a customer complains later on that her 
windshield was not washed, the probability that her car was serviced 
by Jack may be obtained by substituting the corresponding probabilities 
into the formula for the rule of Bayes; thus, we get 


~ (0.20)(0.05) 
(0.20)(0 05) LO. (0.60 V0. 10) + (0.15)(0.10) + (0.05)(0.05) 


= 0.114 


P(B,| A) = 


and it is of interest to note that although Jack fails to wash the wind- 
shield of only 1 car in 20, namely, 5 percent, more than 11 percent of 
the windshields that are not washed at this service station are his 
responsibility. 


EXERCISES 


VA 

vA Referring to Figure 2.8, find P(U/|D) and P(/|D’); assume that originally 
each of the 500 machine parts has the same probability of being chosen for 
inspection. 


2. If in the example on page 34 the probability is 0.75 that the research project 
will be well executed, what is the probability that the project was well plan- 
ned given that it is well executed ? 


4 ‘Referring to Exercise 4 on page 28, Figure 2.11, and assuming equal prob- 
abilities, find the probability that the person whose questionnaire is chosen 
to be double-checked 
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(a) lives more than three miles from the center of the city given that he or 
she would gladly switch to public mass transportation; 

(b) regularly drives his or her car to work given that he or she lives more 
than three miles from the center of the city; 

(c) would not want to switch to public mass transportation given that he or 
she does not regularly drive his or her car to work. 


4. Referring to the probabilities of Figure 2.12, find 


(a) P(A|B); (b) P(B|C"); 

(c) P(A fy BC); (d) P(BU C\A’; 

(e) P(A|BUC); (f) P(AIBN C); 

(gy PAN BOAC|IBN C); (th) PAN BO C|BU C). 


. Referring to the lawn mower example and the probabilities given in Figure 
2.9, find | 
(a) P(E,|P,) and compare its value with that of P(E); 
(b) P(C,|P,) and compare its value with that of P(C2); 
(c) P(E,|P; © C;) and compare its value with that of P(£;). 


6. With reference to Exercise 21 on page 30, find 
(a) the probability that the airport will get the design award given that it 
will get the award for the efficient use of materials; 
(b) the probability that the airport will get the design award given that it will 
not get the award for the efficient use of materials. 


7. Prove that P(A|B) = P(A) implies P(B| A) = P(B) provided P(B) + 0. 


8. Among the 24 invoices prepared by a billing department, 4 contain errors 
while the others do not. If we randomly check two of these invoices, what 
are the probabilities that 
(a) both will contain errors; 

(by neither will contain an error? 


9 hat is the probability of drawing two black cards from a standard deck of 
52 playing cards if 
(a) the first card is replaced before the second card is drawn; 
(b) the first card is not replaced before the second card is drawn? 


10. The probability that Mr. Jones will be late to work on any one day is 0.60. 
Assuming that his being late on any one day is independent of his being late 
on any other day, find the probability that in a given week he will be late on 

nday, on time on Tuesday, Wednesday, and Thursday, and late again on 
riday. 
. Referring to the lawn mower example, the probabilities of Figure 2.9, and 
the results given on page 23, check whether P(E; M1 P, M C,) = P(E;): 
P(P,;)-P(C,). What can you conclude about the events F,, P,, and C;? 


12. Find the probabilities of getting 
(a) six tails in a row with a balanced coin; 
(b) five multiple-choice questions answered correctly, if for each question 
the probability of answering it correctly is }. 
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re The problem of determining the probability that any number of events will 


Pi 


/ 


occur becomes more complicated when the events are not independent. For 
three events A, B, and C, for example, the probability that they will all occur 
is obtained by multiplying the probability of A by the probability of B given 
A, and then multiplying the result by the probability of C given A \ B. For 
instance, the probability of drawing (without replacement) 3 aces in a row 
from an ordinary deck of 52 playing cards is 


ae ee 


— 


52° 51°50 ~«+5,525 


Clearly, there are only 3 aces among the 51 cards which remain after the first 
ace has been drawn, and only 2 aces among the 50 cards which remain after 
the first two aces have been drawn. 
(a) If six bullets, of which three are blanks, are randomly inserted into a gun, 
what is the probability that the first three bullets fired will all be blanks? 
(b) In a certain city during the month of May, the probability that a rainy 
day will be followed by another rainy day is 0.80 and the probability 
that a sunny day will be followed by a rainy day is 0.60. Assuming that 
each day is classified as being either rainy or sunny and that the weather 
on any given day depends only on the weather the day before, find the 
probability that in the given city a rainy day in May is followed by two 
more rainy days, then a sunny day, and finally another rainy day. 
(c) A department store which bills its charge-account customers once a 
month has found that if a customer pays promptly one month, the prob- 
ability is 0.90 that he will also pay promptly the next month; however, if 
a customer does not pay promptly one month, the probability that he 
will pay promptly the next month is only 0.50. What is the probability 
that a customer who has paid promptly one month will not pay promptly 
the next three months? 


AA. The supervisor of an electronics plant knows from past experience that a new 


15. 


worker who has attended the company’s training program has a probability 
of 0.85 of meeting his production quota, while a new worker who has not 
attended the training program has a probability of only 0.33. If 75 percent of 
all new workers attend the training program, what is the probability that a 
new worker will meet his production quota? 


Use the result of Exercise 14 to find 

(a) the probability that a new worker who meets his production quota will 
have attended the training program; 

(b) the probability that a new worker who meets his production quota will 
not have attended the training program. 


~ Members of a consulting firm rent cars from three rental agencies: 20 per- 


cent from agency D, 20 percent from agency E, and 60 percent from agency 
F. If 10 percent of the cars from rental agency D have bad tires, while the 
corresponding percentages for agencies E and F are, respectively, 12 and 4, 


Se, 
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what is the probability that one of the consultants will get a rental car with 
bad tires? 


7. Use the result of Exercise 16 to find the probability that a car with bad tires 
issued to one of the consultants came from rental agency F. 
18. There is a fifty-fifty chance that Firm A will bid for the construction of a new 
city hall. Firm B submits a bid and the probability that it will get the job is 
2 provided Firm A does not bid; if Firm A submits a bid, the probability 
that Firm B will get the job is only 4. If Firm B gets the job, what is the prob- 
ability that Firm A did not bid? 


19. With reference to the example on page 36, find the respective probabilities 
that a voltage regulator which performs according to specifications came 
(a) from supplier B, ; 
(b) from supplier B,. 

20. With reference to the example on page 40, find the probability that the car 
of the complaining customer was serviced by Tom. 


21. An explosion in an LNG storage tank in the process of being repaired could 
have occurred as the result of static electricity, malfunctioning electrical 
equipment, an open flame in contact with the liner, or purposeful action 
(industrial sabotage). Interviews with engineers who were analyzing the risks 
involved led to estimates that such an explosion would occur with probabil- 
ity 0.25 as a result of static electricity, 0.20 as a result of malfunctioning 
electric equipment, 0.40 as a result of an open flame, and 0.75 as a result of 
purposeful action. These interviews also yielded subjective estimates of the 
prior probabilities of these four causes of 0.30, 0.40, 0.15, and 0.15, respec- 
tively. What was the most likely cause of the explosion ? 


2./ MATHEMATICAL EXPECTATION 
AND DECISION MAKING 


If an insurance agent tells us that in the United States a 40-year-old 
woman can expect to live 38 more years, this does not mean that anyone 
really expects a 40-year-old woman to live until her 78th birthday and 
then pass away the next day. Similarly, if we read that a radial tire can 
be expected to last 40,000 miles, that in the coal mining industry we can 
expect 41.5 nonfatal injuries per million man-hours, or that the U.S. 
Patent Office can expect to issue 27,000 certificates of trademarks each 
year, it must be understood that the word “expect” is not used in its 
colloquial sense. So far as the first statement is concerned, some 40-year- 
old women will live another 15 years, some will live another 30 years, 
some will live another 50 years, ..., and the life expectance of “38 more 
years” will have to be interpreted as an average, namely, as a mathematical 
expectation. 
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Originally, the concept of a mathematical expectation arose in con- 
nection with games of chance, and in its simplest form it is the product 
of the probability that a player will win and the amount he stands to win. 


Example. If we stand to receive $10 if a balanced coin comes up fails 
and nothing if it comes up heads, our mathematical expectation is 
10-4 = $5. Similarly, if we buy one of 1,000 raffle tickets issued for a 


and our 





grand prize of $500, the probability for each ticket is i ua 
1 2 


00) = $0.50. Note that 999 of 
the tickets will not pay anything at all, and only one of the tickets will 
pay $500. However, all the 1,000 tickets together pay $500, or on the 


average $0.50 per ticket—this is the mathematical expectation for each 
ticket. 


mathematical expectation is 500 ( 


In this example there was only a single “payoff,” namely, a single prize 
or a single payment. To see how the concept of a mathematical expecta- 
tion can be generalized, let us consider the following example: 


Example. A distributor makes a profit of $20 on an item if it is shipped 
from the factory in perfect condition, but if it is not shipped in perfect 
condition, the profit is reduced to $12 because of the cost of repairs. 
This information in itself does not enable us to determine the dis- 
tributor’s average profit per item, so let us suppose that only 80 percent 
of the items are shipped from the factory in perfect condition. Then 
we can argue that for items shipped from the factory he should make 
a profit of $20 about 80 percent of the time, a profit of $12 about 20 
percent of the time, and on the average he should, thus, make a profit of 


20(0.80) + 12(0.20) = $18.40 


This result is the sum of the products obtained by multiplying each amount 
by the corresponding proportion or probability. Generalizing from this 
example, let us now make the following definition: 





If the probabilities of obtaining the amounts a,, A,,..., OF A, are, 
respectively, D,,p2,..., and p,, then the mathematical expectation 
1S 
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So far as the a’s are concerned, it is important to keep in mind that they 
are positive when they represent profits, winnings, or gains, and that they 
are negative when they represent losses, deficits, or penalties. 


Example. To illustrate this definition, suppose that an engineering 
firm is faced with the task of preparing a proposal for a research con- 
tract; the cost of preparing the proposal is $5,000, and it is assumed 
to be known that the probabilities for potential gross profits of 
$50,000, $30,000, $10,000, or $0 are, respectively, 0.20, 0.50, 0.20, 
and 0.10, provided the proposal is accepted. Assessing the probability 
that the proposal will be accepted as 0.30, we find that there is a 
probability of (0.30)(0.20) = 0.06 of making a net profit of $45,000 
($50,000 minus the cost of the proposal). Similarly, the probabilities 
of making net profits of $25,000 or $5,000 are, respectively, (0.50)(0.30) 
= 0.15 and (0.20)(0.30) = 0.06, while the probability of a $5,000 
loss is (0.10)(0.30) + 0.70 = 0.73, allowing for the eventuality that 
the proposal will not be accepted. Thus, we find that the expected net 


profit is 
45,000(0.06) + 25,000(0.15) + 5,000(0.06) — 5,000(0.73) = $3,100 


Whether or not it is wise to risk $5,000 to make an expected profit 
of $3,100 is not a question of statistics. If a company has little capital 
and can make only few such proposals, the 0.73 chance of losing 
$5,000 may well make the venture unattractive. On the other hand, if 
a company is well capitalized and prepares many such proposals, 
the risk may be worth taking, for it promises an expected return of 
62 percent on the original investment. 


Example. The following is a typical inventory problem in which 
expected values are used to determine optimum conditions. Suppose 
it is known from past experience that the daily demand for a perishable 
product is as shown in the following table: 


Number of orders 





Probability 0.05 0.12 0.20 0.24 0.17 0.14 0.08 


Each item costs $35 (including the cost of carrying it in stock), it is 
sold for $50 provided it is in stock, and an item remaining in stock 
at the end of a day represents a total loss. On the basis of this informa- 
tion we want to determine how many items should be stocked each 
day so as to maximize the expected profit. 
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For 3 units stocked, the profit in dollars is obviously 150 — 105 = 
45, as there is a probability of 1.00 that there will be a demand for 3 
units or more. For 4 units stocked, there is a probability of 0.05 
that exactly 3 units can be sold, a probability of 0.95 that there will 
be a demand for 4 or more, and there is, therefore, an expected dollar 
profit of 


150(0.05) + 200(0.95) — 140 = 57.50 


Similarly, for 5 units stocked there is a probability of 0.05 that exactly 
3 units will be sold, a probability of 0.12 that exactly 4 units will be 
sold, a probability of 0.83 that there will be a demand for 5 or more, 
and there is, therefore, an expected dollar profit of 


150(0.05) + 200(0.12) -+ 250(0.83) — 175 = 64.00 


Continuing in this manner it can be shown that for 6 units stocked the 

expected profit is $60.50, for 7 units stocked the expected profit is 

$45.00, etc., and that the expected profit is a maximum when 5 units 

are stocked (see also Exercise 6 on page 47). 

In recent years, mathematical expectations have been playing an 
increasingly important role in “scientific” decision making, as it is gen- 
erally considered rational to select whichever alternative has the “most 
promising” mathematical expectation: the one which maximizes expected 
profits, minimizes expected costs, maximizes tax advantages, minimizes 
expected losses, and so forth. Although this approach to decision making 
has intuitive appeal and seems very logical, it is not without complications. 

In applying the methods of decision making illustrated in this section, 
there are essentially two practical limitations: first, we must be able to 
assign “cash values” to the various outcomes, and second, we must have 
adequate information concerning all relevant probabilities. The problem 
of assigning cash values to the various outcomes can be much more dif- 
ficult than it was in our examples. For instance, if a government agency 
were faced with the decision of whether to require crash barriers for inter- 
state highway construction, it would be extremely difficult to assign a cash 
value to the savings in life and reductions in injuries that might result. 

The specification of probabilities concerning the various outcomes can 
be equally difficult. In our examples, the probabilities were presumably 
based on past experience, and, hence, are really only estimates (or approxi- 
mations). This may be questioned with respect to the last example (the 
one dealing with the preparation of the research proposal), where it 
would be extremely difficult even to estimate the probability that a given 
proposal will be accepted. This would have to depend not only on the 
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quality of the proposal itself, but on such unknown factors as the number 
and quality of competing proposals as well as the policy of the sponsoring 
agency. The estimate of 0.30 in our example was purely subjective, being 
based on some executive’s “feeling” about the situation measured against 
the background of his personal experience. While such estimates of 
probabilities are not without value, they do leave open the possibility that 
different individuals (acting in the same situation and having the same 
information) would arrive at different “optimum” decisions. 


EXERCISES 


A. charitable organization raises funds by selling 4,000 raffle tickets for a 
vacation trip worth $560, what is the mathematical expectation of a person 
who buys one of these tickets ? What would be the mathematical expectation 
if there were also a second prize of a painting worth $320? 


2. When the American League and National League champions are evenly 
matched, the probabilities that a World Series will end in 4, 5, 6, or 7 games 
are, respectively, 4, 4, ~:, and ~;. What is the expected length of a World 

wees when the two teams are evenly matched? 


8. It costs $60 to test a certain component of a machine. If a defective compo- 
nent is installed, it costs $1,200 to repair the resulting damage to the machine. 
Is it more profitable to install the component without testing if it is known 
that 
(a) 3 percent of all components produced are defective; 
(b) 5 percent of all components produced are defective; 
(c) 8 percent of all components produced are defective? 


4. The two finalists in a golf tournament play 18 holes, with the winner getting 
$20,000 and the runner-up getting $12,000. What are the two players’ mathe- 
matical expectations if KME,s # (20, oa )4 aS ((@_ 080 ) 
(a), they are evenly matched; 

one of the two players should be favored by odds of 3 to 1? 


. If someone were to give us $6 each time we roll a 1 with a balanced die, how 
much should we have to pay him when we roll a 2, 3, 4, 5, or 6 to make the 
game equitable? 


6. Referring to the example on page 45, verify that for 6 and 7 units stocked, 
Pn expected profits are $60.50 and $45.00, respectively. 


. To handle a liability suit, a lawyer has to decide whether to charge a straight 
fee of $600 or a contingent fee of $1,800 which he will get only if his client 
wins. What does the lawyer think about his client’s chances if he feels that 
(a) taking the straight fee will give him a higher mathematical expectation; 


< 
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(b) taking the contingent fee will give him a higher mathematical expecta- 
* tion? 


| A. The manufacturer of a new battery additive has to decide whether to sell his 
’ product for $0.80 a can, or for $1.20 with a “double-your-money-back-if- 
not-satisfied” guaranty. How does he feel about the chances that a person 
will ask for double his money back if 
(a) he decides to sell the product for $0.80; 
(b) he decides to sell the product for $1.20 with the guaranty; 
(c) he cannot make up his mind? 


9, A contractor has to choose between two jobs. The first job promises a profit 
of $240,000 with a probability of 0.75 or a loss of $60,000 (due to strikes and 
other delays) with a probability of 0.25; the second job promises a profit of 
$360,000 with a probability of 0.50 or a loss of $90,000 with a probability of 
0.50. 

(a) Which job should the contractor choose if he wants to maximize his 
expected profit? 
(b) Which job would the contractor probably choose if his business is in bad 
shape and he will go broke unless he can make a profit of at least 
$300,000 on his next job? 


. A merchant can buy an item for $2.10 and sell it for $4.50. The probabilities 
for a demand of 0, 1, 2, 3, 4, or “5 or more” items are, respectively, 0.05, 
0.15, 0.30, 0.25, 0.15, and 0.10. Calculate the expected profits resulting from 
stocking 0, 1, 2, 3, 4, or 5 items and determine how many the merchant 
should stock so as to maximize his expected profit. 







11. Mr. Brown and Mr. Jones are betting on repeated flips of a balanced coin. 
At the beginning Mr. Brown has m dollars, Mr. Jones has n dollars; at each 
flip the loser pays the winner one dollar, and they continue playing until one 
of them has lost all the money with which he began. To find p, the prob- 
ability that Mr. Brown will win Mr. Jones’ 1 dollars before he loses his m 
dollars, set up an equation in m, n, and p which makes use of the fact that in 
an equitable game each player’s mathematical expectation is zero. What is 
the probability that Mr. Brown will win if he begins with $12 and Mr. Jones 
begins with $4? 


. The management of a mining company must decide whether to continue an 
operation at a certain location. If they continue and are successful, this will 
be worth $1,000,000; if they continue and are not successful, this will entail 
a loss of $600,000; if they do not continue but would have been successful, 
this will entail a loss of $400,000 (for competitive reasons); and if they do 
not continue and would not have been successful anyhow, this will be worth 
$100,000 to the company (because funds allocated to the operation remain 
unspent). 

(a) What decision would maximize the company’s expected gain if the odds 
against success are 3 to 2? 
(b) Repeat part (a) with 4 to 1 odds against success. 
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13. 


(c) If the odds of part (a) are correct and it were possible to continue the 
operation for two months at a cost of $300,000, after which it would be 
known for certain whether the operation will be a success, would it be 
worthwhile to spend these additional funds? 

(d) What could one expect the management of the mining company to 
decide if they did not know anything about the chances of success, but 
in general they are confirmed pessimists ? 


A power company is faced with the choice of whether to construct a light- 
water reactor (LWR) or a fossil-fuel power plant (FF). The LWR plant will 
cost $300 per kilowatt to construct, and the FF plant will cost $150 per 
kilowatt. Due to uncertainties concerning fuel availability and the impact of 
future air- and water-quality regulations, the useful operating life of each 
plant is unknown, but the following probabilities have been estimated: 


Useful life (years) 10 20 30 #£40 


Probabilities ee plant 0.05 0.25 0.50 0.20 
FF plant 0.10 0.50 0.30 0.10 

(a) Which plant can be expected to cost less per year of useful life? 

(b) What relative fuel cost per kilowatt would favor construction of the 
LWR plant? 

(c) A further uncertainty concerning safety regulations makes it a 50-50 
bet that $50 per kilowatt will be added to the actual construction cost of 
the LWR plant. What effect does this have on these results ? 
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-B 1 RANDOM VARIABLES 


In most experiments we are interested only in a particular numerical 
description (or, perhaps, two or three particular descriptions) of the 
outcomes. For instance, in the inspection of manufactured products we 
may be interested only in the total number of defective items and not in 
the nature of the defects; in the study of an alloy we may want to know 
the percent of chromium present, without caring about the other elements; 
in the analysis of a road test we may be interested only in the average 
speed and the average fuel consumption, and not in any other perfor- 
mance characteristics; and in the study of the performance of a rotary 
stepping switch we may be interested in its lubrication, the current, and 
the humidity, but not in any other factors. 


50 
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Example. To illustrate this idea more specifically, let us refer again 
to the rating of lawn mowers in the example on page 8 and the 
corresponding probabilities shown in Figure 2.9. Now let us suppose 
that we refer to E, (easy to operate), P, (inexpensive), and C; (low 
average cost of repairs) as preferred ratings, and that we are inter- 
ested only in the number of preferred ratings which a lawn mower will 
get. To find the probabilities that a lawn mower will get 0, 1, 2, or 3 
preferred ratings, let us refer to Figure 3.1, which is like Figure 2.9, 
except that we indicated for each outcome the corresponding number 
of preferred ratings. Adding the respective probabilities, we thus find 





Figure 3.1. Ratings of lawn mowers and number of preferred ratings. 
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that the probability of a lawn mower getting zero preferred ratings 1s 
P(O) = 0.07 + 0.14 + 0.02 + 0.03 = 0.26 


Similarly, letting x represent the number of preferred ratings, we obtain 
the values shown in the following table. 


Value of x 0 1 2 3 


Probability 0.26 0.50 0.22 0.02 


The numbers 0, 1, 2, and 3 in this table are values of the random variable 
describing the number of preferred ratings. Observe that to each outcome 
in the sample space there corresponds one and only one value x of this 
random variable, so that a random variable may be thought of as a func- 
tion defined over the elements of a sample space. Note further that to find 
the probability that a random variable will take on any one value within 
its range, we have only to refer to the probabilities assigned to the indi- 
vidual outcomes and the extension of Axiom 3 given on page 18.* We 
thus define another function, which associates with each value of a random 
variable the probability that this value will be assumed. Such a function, 
an example of which is exhibited by the above table, is called a probability 
distribution function, or simply a probability distribution. To denote the 
values of a probability distribution we shall use such symbols as f (2); 
P(x), 6(z), gQ’), and so on. 

Whenever possible, we try to define probability distributions by means 
of appropriate equations; otherwise we must give a table which actually 
exhibits the correspondence between the values assumed by the random 
variable and the associated probabilities. 


Example. For instance, 
fx~=4 for x = 1, 2, 3, 4, 3,6 


gives the probability distribution for the number of points we roll 
with a balanced die. 


Of course, not every function defined for the values of a random 
variable can serve as a probability distribution. Since the values of a 
probability distribution are probabilities and since one value of a random 
variable must always occur, it follows that if f(x) is a value of a probability 


*It is assumed here that the sample space is finite; the changes that are required when 
a sample space is countably infinite or continuous will be discussed on pages 78 and 98. 
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distribution, then 






f(x) >0 for all x 





i f(x) = 1 


allx 





Example. For instance 


fe =*+ tory 1,9, 3,4 





cannot serve as a probability distribution because f(1) is negative, and 
2" at 
h(x) = 5 for x = 0, I, 2, 3, 4 


cannot serve as a probability distribution because the sum of the five 
probabilities is 2 instead of 1. 


It is often helpful to visualize probability distributions by means of 
graphs like those of Figure 3.2. The one on the left is called a histogram; 





Figure 3.2. Graphs of the probability distribution of the number of 
preferred ratings. 


54 Ch.3 Probability Distributions 


the heights of the rectangles are proportional to the corresponding 
probabilities and their bases touch so that there are no gaps between the 
rectangles representing the successive values of the random variable. The 
one on the right is called a bar chart; the heights of the rectangles are also 
proportional to the corresponding probabilities, but they are narrow and 
their width is of no significance. 

As we shall see later in this chapter, there are many problems in which 
we are interested not only in the probability f(x) that the value of a ran- 
dom variable is x, but also in the probability that the value of a random 
variable is /ess than or equal to x. Writing this probability as F(x), we refer 
to the function which assigns a value F(x) to each x within the range of 
the random variable as the cumulative distribution function, or simply as the 
distribution function. 


Example. Referring again to the lawn-mower-rating example and 
basing our calculations on the table on page 52, we thus get 


Value of x 0 1 2 3 
F(x) 0.26 0.76 0.98 1.00 


for the distribution function of the number of preferred ratings. 


Fa THE BINOMIAL DISTRIBUTION 


Many statistical problems deal with situations referred to as “repeated 
trials.” For example, if we want to know the probability that 1 of 5 rivets 
will rupture in a tensile test, that 9 of 10 vacuum tubes will last at least 
1,000 hours, that at least 60 of a shipment of 75 units are effective, we 
are in each case concerned with a number of “trials” and we are interested 
in the probability of getting a certain number of “successes.” 

Letting n represent the number of trials and x the number of successes, 
problems like these can be solved using the so-called binomial distribution, 
provided that they satisfy the following assumptions: 


1. There are only two possible outcomes for each trial, arbitrarily 
called “success” and “failure” without inferring that a success is 
necessarily a desirable outcome.* 


*Thus, getting a defective unit in sampling inspection may be called a “success”’; this 
terminology carries over from the days when probability theory was applied only to 
games of chance. 
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2. The probability of a success is the same for each trial. 
3. There are n trials, where n is a constant. 
4. The 7 trials are independent. 


Trials satisfying these assumptions are also referred to as Bernoulli trials; 
of course, if the assumptions cannot be met, the theory which we shall 
develop will not apply. 

First, let us find the probability of getting x successes in n trials in a 
given order, when the probability of a success is p and, hence, the prob- 
ability of a failure is 1 — p. Denoting success and failure, respectively, 
by the letters S and F, consider the sequence 


S ©. wie © Ff gat £ 
Son pee’ ‘eens pee 


x n—-xX 


consisting of x successes, followed by n — x failures. By the assumption of 
independence, the probability of this sequence is the product of the prob- 
abilities of the individual outcomes; there are x factors p, n — x factors 
1 — p, and, hence, the probability is p*(1 — p)"-*. Note that the identical 
result would have been obtained for any other sequence containing x 
successes and mn — x failures in a given order. There would also have been 
x factors p and n — x factors 1 — p, and the desired probability would 
also have been given by their product. 

To obtain the probability of x successes and n — x failures in any order, 
we must now add the probabilities of all sequences containing x successes 
and m — x failures in some order. Since each of these sequences. has the 
probability p*(1 — p)""*, we have only to find the number of such se- 
quences and multiply this number by p*(1 — p)"~*. Note that p*(1 — p)"-* 
is, thus, multiplied by the number of ways in which the x letters S can be 
distributed among the n trials. To find this number, suppose that we 
temporarily assume that the S’s are distinguishable, say, by giving them 
subscripts and writing them as S,, S,,..., S,. Beginning with S,, we can 
assign it to any one of the 7 trials, that is, we can assign it in m different 
ways. After that we can assign S, in nm — 1 different ways, S; in nm — 2 
ways, ..., and, finally, S, inn —x-+ 1 different ways. According to 
Theorem 2.1 on page 9, all of these S’s can, thus, be distributed among 
the v trials in n(n — 1)(n — 2)-...+(n — x + 1) different ways. If we 
now omit the subscripts, we find that the various arrangements which we 
have considered are not all distinct. For instance, S,FS,S;F, S,FS;S,F, 
S,FS,S;,F, S,FS,S,F, S;FS,S,F, and S;FS,S,F. all lead to SFSSF. In 
general, the x letters S with subscripts can be arranged among themselves 
in x(x — 1)-...+2+1 different ways according to Theorem 2.1, and, hence, 
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this many arrangements of the S’s with subscripts lead to a single arrange- 
ment once the subscripts are dropped. The number of ways in which the 
x letters S can be distributed among the 7 trials is, thus, given by 


n(n — 1)(n — 2)-...°*(n —x + 1) 
x(x — 1)... °21 


which in the factorial notation can be written as n!/x!(n — x)!.* It follows 
that under the assumptions stated on pages 54 and 55 the probability of 
x successes in 7 trials is given by 


b(x3n, p) = = orf — p)"* for X= Oy Lees xg tt 


x'(n — 


The probability distribution defined by this equation is called the binomial 
distribution. In fact, this equation defines a family of probability distribu- 
tions, with each member of this family being characterized by given values 
of the parameters n and p. 


Example. To give an example where the binomial distribution can be 
applied, suppose that a safety engineer claims that only 60 percent of 
all drivers whose cars are equipped with seat belts use them on short 
trips and that we want to determine the probability that, under these 
conditions, 4 of 5 drivers will be using their seat belts. Substituting 
n= 5, p = 0.60, and x = 4 into the formula for the binomial dis- 
tribution, we get 


b(4; 5, 0.60) = ( 4 )(0.60)*(1 — 0.60)*-* 
= 0.259 


and if we wanted to determine also the probability that at least 4 of 5 
drivers will be using their seat belts, we would get 


b(4; 5, 0.60) + b(5; 5, 0.60) = 0.259 + 0.078 = 0.337 


n! 
x'(n — x)! 
coefficient of a* in the binomial expansion of (a + 6)"; it is often represented by the 


symbol @ 


*The expression is often referred to as a binomial coefficient since it is the 
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The probability that fewer than 4 of 5 drivers will be wearing their 
seat belts is 1 — 0.337 = 0.663, and the corresponding odds are just 
about 2 to 1. 


If nis large, the calculation of binomial probabilities can become quite 
tedious; in that case it may be desirable to use numerical approximations 
or refer to special tables. Values of the binomial distribution and the 
corresponding cumulative distribution function have been tabulated for 

= 2 ton = 49 by the National Bureau of Standards and for n = 50 to 
n = 100 by H. G. Romig (see the Bibliography at the end of the book). 
Table 1 at the end of this book gives the values of 


B(x; n, p) =) b(k; n, ) for x = 0,1, 2..4+48 


forn = 2ton = 20and p = 0.05, 0.10, 0.15, 0.20,..., 0.50. We tabulated 
the cumulative probabilities rather than the values of b(x; n, p), because 
the values of B(x; n, p) are the ones needed more often in statistical 
applications. Note, however, that the values of b(x; n, p) can be obtained 
with the use of Table 1 by making use of the identities in parts (c) and (d) 
of Theorem 3.1 below. Also note that values of B(x; n, p) are not given in 
Table 1 for p greater than 0.50, but they can be obtained from this table 
with the use of the identity given in part (b) of Theorem 3.1. 


THEOREM 3,1 

(a) d(x; n, p) = b(n — x3 n, 1 — p) 

(b) B(x; n, p) = 1— Ba —x—13;n,1-—p) 

(c) d(x; n, p) = Bx; n, p) — B(x — 134, p) 

(d) b(x; , p) = Bn—x3;n, 1 — p)-— Bun—x—13n,1—p) 


The first of these identities can be verified by substituting on both sides 
the corresponding expressions given by the equation defining the binomial 
distribution; the second identity follows from (a) and the definition of a 
cumulative distribution; the third identity follows directly from the 
definition of a cumulative distribution; and the fourth identity follows 
by substitution from parts (a) and (c). The reader will be asked to verify 
these identities in detail in Exercises 5 and 6 on pages 63 and 64. 


Example. To illustrate the use of Table 1 and identities (a) and (c) 
of Theorem 3.1, let us suppose that in a very large shipment of inte- 
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grated-circuit chips, the probability of failure for any one chip is 0.20. 
Assuming, furthermore, that the assumptions underlying the binomial 
distribution are approximately met, the probability that at most 3 
chips fail in a sample of 20 is given by 


B(3; 20, 0.20) = 0.4114 


Also, the probability that 5 or more chips fail in a sample of 16 is given 
by 


3 b(x; 16, 0.20) = 1 — B(4; 16, 0.20) = 1 — 0.7982 = 0.2018 
x=5 


the probability that exactly 12 chips do not fail in a sample of 15 is 
b(12; 15, 0.80) = b(3; 15, 0.20) = 0.2502 
and the probability that exactly 5 chips fail in a sample of 18 is given by 


b(5; 18, 0.20) = B(5; 18, 0.20) — B(4; 18, 0.20) 
— 0.8671 — 0.7164 = 0.1507 


To illustrate the use of identities (b) and (d) of Theorem 3.1, let 
us suppose that somebody claims that 75 percent of all industrial 
accidents could be prevented by paying strict attention to safety 
regulations. Then, if the claim is true, the probability that fewer than 
16 of 20 industrial accidents can be prevented by paying strict attention 
to safety regulations is given by 


B(15; 20, 0.75) = 1 — B(4; 20, 0.25) 


= | — 0.4148 
= 0.5852 


and the probability that exactly 12 of 15 industrial accidents can be 
prevented by paying strict attention to safety regulations is given by 
b(12: 15, 0.75) = BGs 15, 0.25) — BE; 15, 0.25) 
= 0.4613 — 0.2361 
=, 2252 


Example. To illustrate the use of the binomial distribution in a 
problem of decision making, suppose that a washing machine manu- 
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facturer claims that at most 10 percent of his machines require repairs 
within the warranty period of twelve months, and that the decision 
whether to accept or reject his claim has to be made on the basis of 
a sample of 20 of his machines, of which 5 required repairs within the 
first year. To justify the claim one way or the other, let us first deter- 
mine the probability that 5 or more of 20 of these washing machines 
will require repairs within a year when the probability that any one of 
them will require repairs within a year is 0.10. Using Table 1, we get 


2 
S" B(x; 20, 0.10) = 1 — B(4; 20, 0.10) = 0.0432 
x=5 


and since this is very small (the odds against it are about 22 to 1), it 
would seem reasonable to reject the washing machine manufacturer’s 
claim. Note that the probability would have been even smaller if we 
had used a value of p less than 0.10. (See also Exercise 10 on page 64.) 


Interesting information about the shape of (the graph of) binomial 
distributions is illustrated in Figures 3.3 and 3.4. First, if p = 0.50 the 
equation for the binomial distribution becomes 


b(x;n, 0.50) = a -(0.50)" 


( 


and it should be noted that b(x; n, 0.50) = b(n — x; n, 0.50). This means 
that the histogram of a binomial distribution with p = 0.50 is symmetrical, 





Figure 3.3. Symmetrical binomial distribution with nm = 5 and p = 0.50. 
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Figure 3.4. Skewed binomial distributions with n = 5 and p = 0.20 and 
p = 0.80, respectively. 


as is illustrated in Figure 3.3. Now, note that if p is less than 0.50, it is 
more likely that x will be small rather than large compared to n, and that 
the opposite is true if p is greater than 0.50. This is illustrated in Figure 3.4, 
containing the histograms of binomial distributions with nm = 5 and 
p = 0.20 and p = 0.80, respectively. A probability distribution which 
has a histogram like those of Figure 3.4 is said to be skewed; it is said to 
be positively skewed (or to have positive skewness) if the longer “tail” is 
on the right, and it is said to be negatively skewed (or to have negative 
skewness) if the longer tail is on the left. 


.3. THE HYPERGEOMETRIC DISTRIBUTION 


Suppose we are interested in the number of defectives in a sample of 
n units drawn from a lot containing N units, of which a are defective. 
If the sample is drawn in such a way that at each successive drawing what- 
ever units are left in the lot have the same chance of being selected, the 


probability that the first: drawing will yield a defective unit is + but for 


the second drawing it is ga or Wo depending on whether or not 
the first unit drawn was defective. Thus, the trials are not independent, 
the fourth assumption underlying the binomial distribution is not met, 


and the binomial distribution does not apply. Note that the binomial 
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distribution would apply if we sampled with replacement, namely, if each 
unit selected for the sample were replaced before the next one is drawn. 

To solve the problem of sampling without replacement (that is, as we 
originally formulated the problem), let us refer to the result obtained 
on page 56, that a subset of x objects can be selected from a set of n 


objects in 
ee n! 
x} x(n — x)! 


ways. We proved this in connection with the problem of the distribution of 
xX successes among n trials, that is, choosing subsets on which the x suc- 


n ‘ ‘ ‘ ‘ 
cesses are to occur. We referred to (*) as a binomial coefficient, and it 


is also called the “number of combinations of 1 objects taken x at a time.” 
Returning now to the problem of finding the probability of getting 
x defectives in a sample of n units taken without replacement, note first 


that the sample space for this experiment has i) possible outcomes, 


namely, the number of ways in which a subset of m objects can be selected 
from among a set of N objects. Furthermore, the x defectives can be 


selected from the a defectives in ( : ways, the m — x nondefective units in 
the sample can be selected from the N — a nondefective units in the lot 


in eee ways, and according to Theorem 2.1, the whole sample can be 


selected in & (728) ways. Assuming that each of the a samples 


has the same probability of being selected, which is equivalent to the 
assumption stated in the first paragraph of this section, the required prob- 
ability for x “successes” in n trials without replacement is 





by virtue of Theorem 2.3. A probability distribution having this equation 
is called a hypergeometric distribution; the parameters of this family of 
distributions are the sample size n, the lot size (or population size) N, and 
the number of “successes” in the lot a, which in our special example was 
the total number of defectives in the lot (or population). 
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Example. To illustrate the use of the hypergeometric distribution, 
suppose that a shipment of 20 tape recorders contains 5 that are defec- 
tive, and that we want to determine the probability that among 10 of 
these tape recorders chosen for inspection only 2 will be defective. 
Substituting x = 2, n = 10, a= 5, and N = 20 into the formula for 
the hypergeometric distribution, we get 


(3)(8) 
2)/\ 8 : 
H(2s 10, 5, 20) = SPB _ BS 


(10) 
10 

and it stands to reason that the probability should be fairly high—half 
the tape recorders contain just slightly less than half the defectives. 
Probabilities like this are usually calculated using a table of binomial 
coefficients or a table of the logarithms of factorials, which can be 
found in most handbooks of mathematical tables. Note that if we had 
made the mistake of using the binomial distribution with n = 10 and 
Pp = 35 = 9.25 to calculate the probability of two defectives, the result 
would have been 0.282, which is much too small. 


= 0.348 


In this example, the sample size » = 10 was large compared to the 
lot size N = 20, and the effect of not replacing the units was rather severe. 
However, had the sample size been small compared to the lot size, the 
composition of the lot would not have been affected very much by drawing 
the sample, and it would have been reasonable to expect that the binomial 

a 


N should yield a very close approximation. 


distribution with p = 


Continuation of Example. To illustrate this point, suppose that the 
lot contained 100 units, rather than 20, and that 25 of these units, 
rather than 5, are defective. The probability of drawing a sample of 
size 10 with 2 defectives would then be 


(25)(75) 
RQ: 10; 25; 100) = S47" 7 219,902 


Go) 

10 

so that the approximate value of 0.282 given by the binomial distribu- 
tion with n = 10 and p = 25, = 0.25 would have been much closer. 


Actually, in many practical situations the sample size is small compared to 
the lot size (that is, the size of the set from which the sample is drawn), and 
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the binomial distribution very often provides a good approximation to the 
hypergeometric distribution. In fact, it can be shown formally that 


h(x; n, a, N) approaches b(x; n, p) with p = v when N — co while 


p= WV remains fixed. A good rule of thumb is to use the binomial distribu- 
tion as an approximation to the hypergeometric distribution if n< * 


Although we have introduced the hypergeometric distribution in 
connection with a problem of sampling inspection, it has many other 
applications. For instance, it can be used to find the probability that 3 
out of 12 housewives prefer Brand A detergent to Brand B, if they are 
selected from among 200 housewives among whom 40 actually prefer 
Brand A to Brand B. Also, it can be used in connection with a problem of 
selecting industrial diamoniJs, some of which have superior qualities and 
some do not, in connection with a problem of sampling income tax returns, 
where a among N returns filed contain questionable deductions, and so on. 


EXERCISES 


f 


, that a probability of 0.05 is assigned to each point of the sample 
space of Figure 2.4 on page 10. Find the probability distribution of the 
total number of lasers that are suitable for the given task. 


2. An experiment consists of four tosses of a coin. Denoting the outcomes 
HHTH, THTT, ..., and assuming that all 16 outcomes are equally likely, 
Pe the probability distribution for the total number of heads. 


: Check whether the following can define probability functions, and explain 
your answers: 
(a) f(x) =4 for x = 0, 1, 2: 
(b) f(x) = Fz for x = 0, 1, 2, 3, 4, 5; 


S7ays2 = for 0; 4, 2.3: 
(d) f(x) = } for x = 3, 4, 5, 6; 


(ey fea = xe tore] 1,0,3.4.5 








4. Giyen that f(x) = k(4)* is a probability distribution for a random variable 
ich can take on the values x = 0, 1, 2, 3, 4, 5, 6, find & and also find an 
xpression for the corresponding cumulative probabilities F(x). | 






Ay. Prove parts (a) and (b) of Theorem 3.1 on page 57. 
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ia parts (c) and (d) of Theorem 3.1 on page 57. 
7. Prove that 
Ax + 1jn,p)_ pl — x) 
B(x; Nn, P) (lo py +e 1) 


for x = §, 1. 2. — 1, and use this recursion formula to calculate the 
ues of the cifiomial distrpution with n = 6 and p = 0.30. Verify the 
sults by means of Table 1. 


. Using Table 1 find 






(a) B(8: 16, 0.40): (b) b(8; 16, 0.40): 

(c) BO; 12, 0.60); (d) 5(9; 12, 0.60); 

(e) > b(k; 20, 0.15): (f) = b(k: 9, 0.70). 
=6 =6 

9. Using Table 1 find 

(a) B(7; 19, 0.45); (b) 5(7; 19, 0.45); 

(c) B(8; 10, 0.95); (d) 5(8; 10, 0.95); 

(ec) »> b(k; 10, 0.35); (f) > b(k; 9, 0.30). 
= =) 


10. Rework the decision problem on page 58, supposing that only 3 of the 20 
f” ashing machines required repairs within the first year. 


1. Assuming that 2 in 10 automobile accidents are due mainly to driver fatigue, 
find the probability that among 8 automobile accidents 3 will be due mainly 
to driver fatigue 
(a) by using the formula for the binomial distribution: 

(b) by using Table 1. 


12. A social scientist claims that only 40 percent of all high school seniors capa- 
ble of doing college work actually go to college. Use Table 1 to find the 
probabilities that among 14 high school seniors capable of doing college 
work 
(a) at most 6 will go to college; 
fb) at least 4 will go to college; 


FA (c) exactly 5 will go to college. 
3 


If the probability is 0.05 that a hockey game will go into overtime, find the 
probabilities that among 12 hockey games 

(a) exactly one will go into overtime; 

(b) at most 2 will go into overtime; 

(c) 2 or more will go into overtime. 


14. If the probability that a fluorescent light has a useful life of at least 500 hours 
is 0.85, find the probabilities that among 20 such lights 
(a) exactly 18 will have a useful life of at least 500 hours; 
(b) at least 15 will have a useful life of at least 500 hours; 
(c) at least 2 will not have a useful life of at least 500 hours. 
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15. A quality control engineer wants to check whether (in accordance with speci- | 
fications) 95 percent of the electronic components shipped by his company 
are in good working condition. To this end, he randomly selects 15 from 
each very large lot ready to be shipped and passes it only if they are a// in 
good working condition; otherwise, each of the components in the lot is 
checked. 

(a) What is the probability that he will commit the error of holding a lot for 
further inspection even though 95 percent of the components are in good 
working condition? 

(b) What is the probability that he will commit the error of letting a lot pass 
through without further inspection even though only 90 percent of the 


components are in good working condition? 
wks 


ood processor claims that at most 10 percent of his jars of instant coffee 
contain less coffee than claimed on the label. To test this claim, 16 jars of his 
instant coffee are randomly selected and the contents are weighed; his claim 
is accepted if fewer than 3 of the jars contain less coffee than claimed on the 
label. Find the probabilities that the food processor’s claim will be accepted 
when the actual percentage of his jars containing less coffee than claimed on 
the label is 
(a) 0.05; (b) 0.10; 


0.15 (d) 0.20 

fe control engineer inspects a random sample of three toasters from 
each lot of 24 toasters that is ready to be shipped. If such a lot contains six 
toasters with slight defects, find the probabilities that the inspector’s sample 
will 
(a) contain only toasters without defects; 
(b) contain exactly one toaster with a defect; 
(c) contain at least two toasters with defects. 


18. If 6 of 18 new buildings in a city violate the building code, what is the prob- 
ability that a building inspector, who randomly selects 4 of the new build- 
ings, will catch 
(a) none of the new buildings that violate the building code; 

(b) exactly one of the new buildings that violate the building code; 
{© exactly two of the new buildings that violate the building code; 
(d) at least three of the new buildings that violate the building code? 


Ag, A collection of 15 gold coins contains four counterfeits. If two of these coins 
are randomly selected to be sold at auction, what are the probabilities that 
(a) neither coin is a counterfeit; 
(b) only one of the two coins is a counterfeit ; 
(c) both coins are counterfeits ? 


( 


20. Among the 16 cities which a professional society is considering for its next 
three annual conventions, 7 are in the Western part of the United States. If, 
to avoid arguments, the selection is left to chance, what are the probabilities — 
that 
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(a) none of these conventions will be held in the western part of the United 
States; 

(b) all of these conventions will be held in the western part of the United 
States ? 


21. A shipment of 120 burglar alarms contains 5 that are defective. If 3 of these 
burglar alarms are randomly selected and shipped to a customer, 
(a) find the probability that he will get exactly one bad unit; 
(b) approximate the probability that he will get exactly one bad unit by 
means of the binomial distribution. 
Also, round both answers to three decimals and find the percentage error of 
the approximation. 


22. Among the 300 employees of a company, 240 are union members while the 
others are not. If 8 of the employees are to be chosen to serve on a committee 
which administers the pension fund, find the probability that five of them will 
be union members while the others are not 
(a) by using the formula for the hypergeometric distribution; 

(b) by approximating the probability with the use of the formula for the 
binomial distribution. 

Also, round both answers to four decimals and find the error of the approxi- 

mation. 


vA THE MEAN AND THE VARIANCE 
OF A PROBABILITY DISTRIBUTION 


Besides the binomial and hypergeometric distributions there are many 
other probability distributions that have important applications. However, 
before we go any further, let us discuss some general characteristics of 
probability distributions. One such characteristic, that of the symmetry 
or skewness of a distribution, was illustrated by means of Figures 3.3 
and 3.4; two others will be presented here with reference to Figure 3.5. 


Example. Figure 3.5 shows the histograms of two binomial distribu- 
tions; the one which is shaded has the parameters n = 16 and p = 4, 
and the one which is not shaded has the parameters n = 4 and p = 4, 
Essentially, these two distributions differ in two respects. The first 
distribution (whose histogram is shaded) is “centered” about x = 8 
while the other is “centered” about x = 2, and we thus say that the 
two distributions differ in their /ocation. Another distinction between 
the two distributions is that the histogram of the first is more “spread 
out,” the probabilities represented by the rectangles are more “dis- 
persed,” and we say that the two distributions differ in variation. 
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Figure 3.5. Histograms of two binomial distributions. 


To make such comparisons more specific, we shall introduce in this 
section two of the most important measures describing, respectively, the 
location and the variation of a probability distribution; they are the mean 


ee of a distribution. 
Continuation of Example. Concentrating for the moment on the second 


of the probability distributions of Figure 3.5, it can easily be verified 
(see Exercise 2 on page 63) that the ea cae for #0, I,. 2,.3, 
or 4 are, respectively, +4, <4, &, 3: and =. Hence, we could say that 
if we repeatedly observed values of this random variable (for instance, 
if we observed the number of heads in tee flips of four coins), 
then in the long run x should aie 0 about 71 the time, it should equal 
1 about =4 of a time, 2 about =& of the Bae 3 about =4 of the time, 
and 4 about 2t: Té of the time. Applying the definition of moatiiematical 
expectation given on page 44, we obtain 


Ong b lets ts *"t¢ op a ae 2 


for the expected number of heads. This result, representing the long- 
run average of the random variable having the binomial distribution 
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with n = 4 and p = 4 is what we referred to as the “center” of the 
distribution; indeed, this average is what we call the mean of the 
probability distribution. 


In general, the mean of a probability distribution with values f(x) is — 
defined by the equation 


U = er (x) 





and it is denoted by the Greek letter uw (mu). It measures the “center” of 
a probability distribution in the sense of an average or, better, in the sense 
of a center of gravity. Note that the above formula for y is, in fact, the 
first moment about the origin of a discrete system of masses f(x) arranged 
on a straight line at distances x from the origin. We do not have to divide 
here by py f(x), as we do in the usual formula for the x-coordinate of the 


center of gravity, since this sum is by definition equal to 1. 


Continuation of Example. Returning to the first distribution of Figure 
3.5, we could find its mean by calculating all the necessary prob- 
abilities or by looking them up in Table 1, and substituting them into 
the formula for 4“. However, if we reflect for a moment, we might 
argue that there is a fifty-fifty chance for a success on each trial: there 
are 16 trials, and it would seem reasonable to expect on the average 
8 successes in 16 trials. Similarly, we might argue that if a binomial 
distribution has the parameters n = 200 and p = 0.20, we can expect 
a success about 20 percent of the time and, hence, on the average 
200(0.20) = 40 successes in 200 trials. 


Indeed, this argument can be extended to any binomial distribution 
and we shall now prove that the mean of the binomial distribution with the 
parameters n and p is given by 





Substituting the expression which defines b(x; n, p) into the formula for y, 
we get 


w= > x- 


=, oe x)? = @ caer) a * 
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Then, making use of the fact that = = =—T and n! = n(n — 1)!, and 
factoring out n and p, we obtain 
—_ . (n —_ 1)! x=4 _. 7 \i-* 
e mp 2a x—Din—x =P) 


where the summation starts with x = 1 since the original summand is zero 
for x = 0. If we now let y = x — 1 and m = n — 1, we obtain 


mt m! 
= : y 1 — m-y 
B oP a ren P) 


and this last sum can easily be recognized as that of all the terms of the 
binomial distribution with the parameters m and p. Hence, this sum 
- equals | and it follows that ~ = np. 

The result obtained applies, of course, only to the binomial distribu- 
tion, but given any other special distribution, we can, similarly, express 
the mean in terms of its parameters. Thus, for the mean of the hyper- 
geometric distribution with the parameters n, a, and N we can write 





and, using a method similar to the one we employed in deriving the for- 
mula for the mean of the binomial distribution, the reader will be asked 
to prove this result in Exercise 6 on page 77. 


Example. Referring to the example on page 62, where 5 of 20 tape 
recorders were defective, we can now say that if 10 of them are chosen 
for inspection, the distribution of the number of defectives has the 
mean 


fb = 10-3 = 2.5 
In other words, if we inspect 10 of the tape recorders, we can expect 
2.5 defectives, where the word “expect” is to be interpreted as a 
mathematical expectation. 


As defined, the mean of any probability distribution is the expected value 
of the corresponding random variable—in accordance with the formula 
for a mathematical expectation, mu is given by the sum of the products 





70 Ch. 3 Probability Distributions 


obtained by multiplying each value of the random variable by the cor- 
responding probability. 

To study the second of the two properties of probability distributions 
mentioned on page 66, the one concerning their variation, let us refer 
again to the two distributions of Figure 3.5. For the distribution whose 
histogram 1s not shaded there is a high probability that we will get a value 
close to the mean; so far as the other distribution is concerned, the one 
whose histogram is shaded, there is a high probability of getting values 
scattered over considerable distances away from the mean. Using this 
property, it may seem reasonable to measure the variation of a probability 
distribution with the quantity 


XE — w-F@) 


namely, the average amount by which the values of the random variable 
deviate from the mean. Unfortunately, 


Be-MSO) =F xe f@—-T we f@) 
=h- MD fX)=H-U=0 


so that this expression is always equal to zero. However, since we are really 
interested only in the magnitude of the deviations x — yu (not in their 
signs), it suggests itself that we average the absolute values of these devia- 
tions from the mean. This would, indeed, provide a measure of variation, 
but on purely theoretical grounds we prefer to work instead with the 
squares of the deviations from the mean. These quantities cannot be 
negative either, and their average is indicative of the spread or dispersion 
of a probability distribution. We thus define the variance of a probability 
distribution with values f(x) as 


0? = Oe — wf) 


where o (sigma) is the lower-case Greek letter for s. This measure is not 
in the same units (or dimension) as the values of the random variable, but 
we can adjust for this disadvantage by taking its square root, thus defining 
the standard deviation 


a= / 3) (x — HYP F(X) 
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Example. To illustrate how the standard deviation measures the 
“spread” of a probability distribution, let us compare the values of 
o of the two distributions of Figure 3.5. Having already shown on 
page 67 that for the distribution with n = 4 the mean is “ = 2, we 
find that the variance of this distribution is 


go? = (0 — 2)? te + (1 — 2)? he + (2 — 2)? 
+ (3 = 2)? + (4 = 2)? ge 
=I 


and, hence, the standard deviation is 0 = 1. Similarly, it can be shown 
that for the other distribution o = 2, and we find that the shaded 
distribution with the greater spread also has the greater standard devia- 
tion. 


Given any probability distribution, we can always calculate a? by sub- 
stituting the corresponding probabilities f(x) into the formula which 
defines the variance. As in the case of the mean, however, this work can be 
simplified to a considerable extent when we deal with special kinds of 
distributions. For instance, it can be shown that the variance of the bino- 
mial distribution with the parameters n and p is given by the formula 


Ln | 


o? =n-p-(1 — p) 


DT 


and that the variance of the hypergeometric distribution with the parameters 
n, a, and N is given by the formula 
ge Ba Fc ee 


Wer a ea (N — a)-(N — 2) 
Ds N2-(N — 1) 
Fe ee ca ae a rg eg ce gel 


We shall not prove these formulas here, but let us apply them to the two 
distributions of Figure 3.5. 


Continuation of Example. Since the two distributions are binomial 
distributions with the parameters n = 4 and p = 4, and n = 16 and 
p = 4, substitution into the formula yields 


o” = 4.1.5 — 1 
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and 
o7 = 16-4-4=4 


and this agrees with the results which we gave before. 


When we first defined the variance of a probability distribution, it 
may have occurred to the reader that the formula looked exactly like the 
one which we use in physics to define second moments, or moments of 
inertia. Indeed, it is customary in statistics to define the kth moment about 
the origin as 


My = x*+ f(x) 
and the kth moment about the mean as 
Ly = Ce pe FO) 


Thus, the mean y is the first moment about the origin, and the variance 
a” is the second moment about the mean. Higher moments are often used 
in statistics to give further descriptions of probability distributions. For 
instance, the third moment about the mean (divided by a3 to make this 
measure independent of the scale of measurement) is used to describe the 
symmetry or skewness of a distribution; the fourth moment about the 
mean (divided by a‘) is, similarly, used to describe its “peakedness,” or 
kurtosis. To determine moments about the mean, it is usually easiest to 
express them in terms of moments about the origin and then to calculate 
the necessary moment about the origin. For the second moment about the 
mean we thus have the important formula 





a = fy, — Ww? 





which the reader will be asked to prove in part (a) of Exercise 11 on page 
77; part (b) of that exercise contains a similar formula for expressing 
4; in terms of moments about the mean. 


Example. To illustrate the use of this “short-cut” formula for com- 
puting the variance, we return to the distribution of the number of 
heads in four flips of a coin illustrated in Figure 3.5. For this distribu- 
tion 4“ = 2 and, applying the formula for yw), with k = 2, we get 


Hy = OPorly + Taide + 2haghy + Parle + dg 
=—h | 
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Thus 
o#=5-—-2=1 


which is the result previously obtained. 


¥ .5  CHEBYSHEV ’S THEOREM 


Earlier in this chapter we used examples to show how the standard 
deviation measures the variation of a probability distribution, that 1s, 
how it controls the concentration of probability in the neighborhood 
of the mean. Ifa is small, there is a high probability for getting a value close 
to the mean; if o is large there is a correspondingly higher probability for 
getting values further away from the mean. Formally, this idea is expressed 
by the following theorem: 





THEOREM 3.2 (Chebyshev’s theorem). If a probability distribution 
has the mean yu and the standard deviation oa, the probability of 
obtaining a value which deviates from the mean by at least k stan- 
dard deviations is at most I/k*. Symbolically, 


1 
P(x—plzko)s 





Example. Thus, the probability that a random variable will take on 
a value which deviates (differs) from the mean by at least 2 standard 
deviations is at most 4, the probability that it will take on a value 
which deviates from the mean by at least 5 standard deviations is at 
most =, and the probability that it will take on a value which deviates 
from the mean by 10 standard deviations or more is less than or equal 


1 
to Too" 


To prove this theorem, consider any probability distribution having the 
values f(x), the mean y, and the variance o?. Dividing the sum defining 
the variance into three parts as indicated in Figure 3.6, we have 


o? =F ( — wf) 
=D — WO) + EE — MF) + LE - HY) 


where R, is the region for which x < wu — ko, R, is the region for which 
u—ko<x<yp+ko, and R; is the region for which x > w+ ko. 
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Figure 3.6. Diagram for proof of Chebyshev’s theorem. 


Since (x — £)*f(x) cannot be negative, the above sum over R, is non- 
negative, and without it the sum of the summations over R, and R; is 
less than or equal to o?, that is, 


7 > De — WH) +L @ — we) 


But x — uw < —ko in the region R, and x — uw > ka in the region R;, so 
that in either case |x — yu| > ko; hence, in both regions (x — pu)? > k?a?. 
If we now replace (x — yu)? in each sum by k?o?, a number less than 


or equal to (x — yz)”, we obtain the inequality 


o* > Ko f(x) + 2, ko? f(x) 


or 


Since p» (x) + 2d f(x) represents the probability that x is in the region 
Ry R,, samely. ‘that |x — “| > ko, we finally have 


P(|x —pl>ka<s5 
and this completes the proof of Theorem 3.2. 

To obtain an alternate form of Chebyshev’s inequality, as the inequality 
of Theorem 3.2 is sometimes called, note that the event |x — u| < ka is 
the complement of the event |x — u| > ko; hence, 


_—. 
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] 
P(x — ul <ko) 21-7 
which states that the probability of getting a value within & standard 
deviations of the mean is at least 1 — a 
Theoretically speaking, the most important feature of Chebyshev’s 
theorem is that it applies to any probability distribution for which u and o 
exist. So far as applications are concerned, however, this generality is also 
its greatest weakness; it only provides an upper limit (often a very poor 
one) to the probability of getting a value which deviates from the mean 
by more than k standard deviations. 


Example. For instance, we can assert in general that the probability 
of getting a value which deviates from the mean by at least two stan- 
dard deviations is not more than 0.25, whereas the corresponding exact 
aoa for the binomial distribution with the parameters n = 16 
and p = + is only 0.0768—“not more than 0.25” is correct, but it does 
not tell c that the actual probability is as small as 0.0768. 


An important theoretical result is obtained if we apply Chebyshev’s 
theorem to the binomial distribution. Substituting w~—=np and go = 
,/np(1 — p) into the alternate form of the theorem given above, we get 


Pllx — np| <ka/mpd =P) = 1 — zp 


x pu = P)) oh 
P(| nn? | = ka/ n =! Ke 
after we divide both terms of the inequality inside the parentheses by x. 


If we now substitute e€ for k | pl 2), we obtain 


Or 


x p(l — p) 
P(|4 p|<e) =P 5B 
and it follows that 


lim P(|= —p|<e) af 
for any € > 0. The last equation expresses the weak law of large numbers. 
In words, as the sample size increases, the probability that the observed 
proportion of successes differs from p by less than any arbitrary positive 
constant € approaches 1. Note that we cannot conclude from this that 
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for large n the actual number of successes must necessarily be close to 
Lt = np. In fact, it can be shown that although the probability that the 
proportion of successes is close to p approaches | as n becomes large, the 
probability that the number of successes is close to mp approaches 0. 


Example. For instance, for 40,000 flips of a vas coin we have 
i = 40,000-4 = 20,000. and o = ./40,0004-4= 100, so that 
Chebyshev’s theorem with k = 10 tells us that the probability is at 
least 0.99 that we will get between 19,000 and 21,000 heads, or that the 
proportion of heads will fall between 0.475 and 0.525. 


ae = (oe LO ae fl a y Pa “ F /, f . [io . ?. £ 


ee: je ES 2 Oe 
go ee EXERCISES 
Fs Suppose that the probabilities are, respectively, 0.4, 0.3, 0.2, and 0.1 that 
there will be 0, 1, 2, or 3 power failures in a certain city during the month of 
July. 
(a) Calculate the mean of this probability distribution and explain its signi- 
ficance. 
(b) Calculate the variance of this probability distribution. 


2. The following table gives the probabilities that a certain computer will mal- 
function 0, 1, 2, 3, 4, 5, or 6 times on any one day: 


Number of 
malfunctions x 0 1 Z 3 4 - 6 


Probability f(x) 0.117 0.29 0.27 0.16 0.07 0.03 0.01 


(a) Find the mean of this probability distribution. 
(b) Find the standard deviation of this probability distribution without 
y using the short-cut formula on page 72. 
Ac) Find the standard deviation of this probability distribution using the 
/ short-cut formula on page 72. 


i. Find the mean and the variance of the uniform probability distribution given 
by 


fe) == or’, te i. ee 


[Hint: The sum of the first n positive integers is 4n(n + 1), and the sum of 
their squares is 4n(n + 1)(2n + 1).] 


A. Find and o? for the binomial distribution which has the parameters n = 5 
and p = 0.70 by using 
(a) the formulas which define “4 and o? on pages 68 and 70 and the prob- 
abilities of Table 1; 
(b) the special formulas on pages 68 and 71. 
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5. Find 4 and a? for the hypergeometric distribution which has the parameters 
n=3,a=4,and N= 8 
(a) by first calculating the necessary probabilities and then using the for- 
mulas which define 4 and a? on pages 68 and 70; 
PF i by using the special formulas on pages 69 and 71. 
V6 


. Prove the formula for the mean of the hypergeometric distribution given on 


page 69, namely, “ = new. [Hint: Make use of the identity 


Ve =) = ("ED 


which can be obtained by equating the coefficients of x* in (1 + x)"(1 + x)s 
and in (1 + x)"*s5,] 


d Construct a table showing (a) the upper limits provided by Chebyshev’s | 
theorem for the probabilities of obtaining values differing from the mean by 
at least 1, 2, and 3 standard deviations, and (b) the corresponding probabil- 
ities for a binomial distribution with the parameters n = 16 and p = 4. 


8. What does Chebyshev’s theorem tell us about the probability of getting 
» anywhere between 30 and 105 threes in 405 rolls of a balanced die : 


/ How many times do we have to flip a balanced coin to be able to assert with 
a probability of at least 0.99 that the difference between the proportion of 
tails and 0.50 will be less than 0.04? 


10. If a random variable has a binomial distribution with the parameter p = 1, 
what can we assert about the proportion of successes with a probability of at 
least 42 when the other parameter is 

f (a) i= 300; 


(b) n = 7,500; 


/ (c) n = 120,000? 
‘11. Prove that 


(a) 0? = LW, — pW; 
(b) Us = My — 3uy-u + 2p}. 


<i 
Ry. 
ta, 


Oh 


‘%3.6 THE POISSON APPROXIMATION 


OF THE BINOMIAL DISTRIBUTION 





When 7 is large and p is small, binomial probabilities are often approxi- 
mated by means of the formula 





janes 


Xx 


tor x = 0, 1, 2, «<: 
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with A (lambda) equal to the product np. Before we prove that this is 
reasonable, let us point out that x = 0, 1, 2, ... means that there is a 
countable infinity of possibilities, and this requires that we modify the 
third axiom of probability given on page 18. In its place we substitute 





Axiom 3’. If A,, A,, A;,...is a finite or infinite sequence of mutual- 
ly exclusive events in S, then 


P(A, U A, U A; U...) = P(A) + P(A,) + P(A3) +... 


while the other postulates remain unchanged; that is, probabilities must 
still be nonnegative, and for the whole sample space the probability must 
be 1. To verify the latter for the formula given above, we make use of 
Axiom 3’ and write 





Mey et at ye 
z=0 xX! 
Since the infinite series in the expression on the right is the Maclaurin 
series expansion of e’, it follows that P(S) = I. 
Let us now show that when n — oo and p — 0, while np = A remains 
constant, the limiting form of the binomial distribution is as given in the 


beginning of the preceding paragraph. First let us substitute A for p into 


the formula for the binomial distribution and simplify the resulting expres- 
sion; thus, we get 


b(x3 2, p) = —- (4 y(1 _ Ay" 


n 





x'\(n — x)! n 
_ n= Weve x4 Day) — A) 
_ 11-2)... 223 
- (oa a ET) o( 7° 


If we now let n — oo, we find that 


(-A-F- 
0-ay"=[0-4y"Y0-4y 





that 
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and, hence, that the binomial probability approaches 





Are” tory = 0, 1.2.44; 


pe 


This completes our proof; the distribution at which we have arrived is 
called the Poisson distribution. Since it has many important applications 
other than approximating binomial probabilities (see Section 3.7), the 
Poisson distribution has been extensively tabulated. Table 2 at the end 


of the book gives the values of the probabilities F(x; A) = py tT (k; 2) 


for values of J in varying increments from 0.02 to 25, and its use is very 
similar to that of Table 1. 

The following are some examples illustrating the Poisson approxima- 
tion of binomial probabilities. An acceptable rule of thumb is to use this 
approximation if m > 20 and p< 0.05; if m > 100, the approximation 
is generally excellent so long as np < 10. 


Example. First let us compare the binomial probability of getting 2 
successes in 100 trials when the probability of a success is 0.05, with 
the corresponding Poisson probability of getting 2 successes when 
np = 100(0.05) = > Substituting into the appropriate expressions, 
we get 


b(2; 100, 0.05) = (3 0, 05)2(0.95)?* = 0.081 


(0:5) =2 ~e~ = 0.084 





and it is interesting to note that the difference (the error we would 
make by using the Poisson approximation) is as small as 0.003, or less 
than 4 percent. Had we used Table 2 (instead of looking up e~* in 
a mathematical table), we would have obtained F(2; 5) — F(1;5) = 
0.125 — 0.040 = 0.085. 


Example. Suppose that a fire insurance company has 3,840 policy- 





holders, and that the probability is that any one of these policy- 


| 
1,200 
holders will file at least one claim during any given year. To find the 
probabilities that during a given year 0, 1, 2, 3, 4, ... of the policy- 
holders will file at least one claim, we can, for practical reasons, 
forget about using the formula for the binomial distribution. Instead 


we use the Poisson distribution with A = 3,840: = 3.2, and, 
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Figure 3.7. Histogram of Poisson distribution with A = 3.2. 


using Table 2 and the identity f(x;4) = F(x; 4) —F — 1;A), 
we get the probabilities shown in the histogram of Figure 3.7. 


Using a method similar to that which we used in deriving the formula 
for the mean of the binomial distribution, it can be shown that the mean 
and the variance of the Poisson distribution with the parameter 4 are given 
by 





pA and of%=2 





The first of these results should really have been expected, for in our proof 
we let A equal np, which is the mean of the corresponding binomial dis- 
tribution. 
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2.7 POISSON PROCESSES 


In general, the term “random process” refers to a physical process 
which is wholly or in part controlled by some sort of chance mechanism. 
It applies to a sequence of repeated flips of a coin, to measurements of 
the quality of manufactured products coming off an assembly line, to the 
vibrations of airplane wings, to the “noise” in a radio signal, and to 
numerous other phenomena. What characterizes such processes is their 
“time-dependence,” namely, the fact that certain events do or do not take 
place (depending on chance) at regular intervals of time or throughout a 
continuous interval of time. 

In this section we shall be concerned with processes taking place over 
continuous intervals of time, such as the occurrence of imperfections on 
a continuously produced bolt of cloth, the recording of radiation by 
means of a Geiger counter, the arrival of telephone calls at a switchboard, 
or the passing by cars of an electronic checking device. We will now show 
that the mathematical model which we can use to describe many situa- 
tions like these is that of the Poisson distribution. To find the probability 
of x successes during a time interval of length T, we divide the interval 
into n equal parts of length At, so that T = n-At, and we assume that: 


1. The probability of a success during a very small interval of time 
At is given by a-At. 

2. The probability of more than one success during such a small 
time interval At is negligible. 

3. The probability of a success during such a time interval does not 
depend on what happened prior to that time. 


This means that the assumptions underlying the binomial distribution (see 
pages 54 and 55) are satisfied, and the probability of x successes in the time 


interval T is given by the binomial probability b(x; n, p) withn = “ and 


p = a-At. Then, following the argument on page 78, we find that when 
n — oo the probability of x successes during the time interval T is given 
by the corresponding Poisson probability with the parameter 2 = n-p = 


. a (a At) = aT. Since A is the mean of this Poisson distribution, note that 


o is the average (mean) number of successes per unit time. 


Example. For instance, if a bank can expect to receive 6 bad checks 
per day, the probability of its getting 4 bad checks on any one day is 


cy. 6t+e7® _ 1,296(0.0025) __ 
f(4;6) = =o = 0.135 
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and the probability of its getting 10 bad checks on any two con- 
secutive days is 


f(10; 12) = oe — 0.105 


where we actually obtained this last value by determining F(10; 12) — 
F(9; 12) with the use of Table 2. 

Also, if in the inspection of tin plate produced by a continuous 
electrolytic process, 0.2 imperfections are spotted on the average per 
minute, the probability of spotting 1 imperfection in 3 minutes is 


F(1; 0.6) — F(0; 0.6) = 0.878 — 0.549 = 0.329 
the probability of spotting at /east 2 imperfections in 5 minutes is 
1 — F(1; 1) = 1 — 0.736 = 0.264 
and the probability of spotting at most 1 imperfection in 15 minutes is 
F(1; 3) = 0.199 


Example. The Poisson distribution has many important applications 
in queueing problems, where we may be interested, for example, in the 
number of customers arriving for service at a cafeteria, the number 
of ships or trucks arriving to be unloaded at a receiving dock, the 
number of aircraft arriving at an airport, and so forth. Thus, if on 
the average 0.3 customers arrive per minute at a cafeteria, then the 
probability that exactly 3 customers will arrive during a 5-minute 
span 1s 


F(B; 1.5) — F(2; 1.5) = 0.934 — 0.809 = 0.125 
and if on the average 3 trucks arrive per hour to be unloaded at a 


warehouse, then the probability that at most 20 will arrive during an 
8-hour day is 


F(20; 24) = 0.243 
Also, if on the average 14.5 planes arrive per day at a private airport, 


then the probability that the number of arrivals on any one day will be 
anywhere from 12 to 15, inclusive, is 


F(15; 14.5) — F(11; 14.5) = 0.619 — 0.220 = 0.399 
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/ 
As THE GEOMETRIC DISTRIBUTION 


In the example on page 7 we indicated that a countably infinite 
sample space would be needed if we were interested in the number of cars 
a person has to inspect until he finds one whose nitrogen-oxide emission 
does not meet government standards. To treat this problem more gener- 
ally, suppose that in a sequence of trials we are interested in the number 
of the trial on which the first success occurs, and that the first, second, and 
fourth assumptions underlying the binomial distribution (pages 54 and 55) 
are satisfied. Clearly, if the first success is to come on the xth trial, it has 
to be preceded by x — | failures, and if the probability of a success is p, 
the probability of x — 1 failures on x — | trials is (1 — p)*~!. Then, if we 
multiply this expression by the probability p of a success on the xth trial, 
we find that the probability of getting the first success on the xth trial is 
given by 


g(x;p)=p(l—p)*"' for x = 1, 2, 3, 4,... 


This probability distribution is referred to as the geometric distribution, 
and, as the reader will be asked to verify in Exercise 19 on page 88, the 
mean of this distribution is given by 


pa 
Pp 


Example. For instance, if the probability that a burglar will get 
caught on any given “job” is 0.20, then the probability that he will 
get caught for the first time on his fourth “job” is 
2(4; 0.20) = (0.20)(0.80)*-! 
= 0.102 
Similarly, if the probability is 0.05 that a part produced by a machine 
will be defective, then the probability that the sixth part produced 
on a given day is the first one that is defective is 
2(6; 0.05) = (0.05)(0.95)5-! 
= 0.039 
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Example. In queueing theory (see example on page 82), the geometric 
distribution has important applications in connection with the number 
of units (customers, trucks, airplanes, etc.) that are being served or 
waiting to be served at any given time. To illustrate, suppose that 
the arrival of cars at a gas station with one pump is a Poisson process: 
with a mean arrival rate (average number of arrivals per unit time) 
of « cars per hour, and that the cars can be served at a mean service rate 
of B cars per hour. Since the arrivals constitute a Poisson process, the 
probability is negligible that there will be more than one arrival during 
a small time interval At, and we shall assume that the same is true 
also for completions of service.* Furthermore, we shall assume that 
a < PB, namely, that the mean number of arrivals per unit time is less 
than the mean number of services that can be completed per unit 
time. 

To arrive at an expression for P(k), the probability that there are k 
cars in the gas station at any given time being served or waiting to be 
served, we shall first obtain a set of “recursion relations” expressing 
P(k) in terms of P(k — 1) and P(k — 2). To illustrate how these rela- 
tions are obtained, let us find the probability that there are no cars 
in the station (k = 0) at time ¢ + At. If k = 0 at time ¢t + At, by the 
assumption of Poisson arrivals the only values that k could have had 
at time ¢ are 0 and 1. If k = 0 at time ¢, then no new cars arrived in the 
interval from t to t + At (with probability 1 — @-At), but if k = 1 
at time ¢, then a service was completed in this interval (with prob- 
ability B-Ar). Using the rule of elimination (see page 37) we, thus, 
conclude that 


P(0) = P(O)(1 — a At) + P(1)B-At 


Or 


P(1) = PO 


In general, the situation is typified by the tree diagram of Figure 3.8, 
and the rule of elimination leads to the following recursion relation: 


P(k) = P(k — 1)a-At + P(ALL — (@ + B)At] + P(k + 1)f-At 


for k > 1. In Exercise 18 on page 88, the reader will be asked to 
verify that a solution to this set of equations is given by the geometric 


*In fact, it will be assumed that so long as the service station is not empty, comple- 
tions of service constitute a Poisson process. 
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Figure 3.8. Tree diagram for queueing problem. 


distribution 
k 
P(k) = (1 — $\(4) fork =0, 1,2, ... 
(x) BNR 
which may be obtained from the expression on page 83 by letting 
O 


= 1]1——andx=k+ 1. 
an 


This distribution reveals many important properties about queues. 
- Note first that since ~ < # by assumption, we have 


lim P(k) = 0 


ko 


so that the probability is 1 that the waiting line remains finite. (In fact, 
it can be shown that if « > # there is a nonzero probability that the 
waiting line will eventually grow beyond all bounds.) The mean 
number of cars in the station (being served and waiting to be served) 
is given by the mean of the above distribution, which is 


a —_ _ tes 
Pp isis a 
B 
in accordance with the formula for uw on page 83. (We subtracted the 
1 because the values of the random variable started with k = 0 
instead of x = 1.) 
Also, the distribution of the numbers in the queue waiting for service 


(thus, excluding the one being served) is given by P(k + 1) fork > 1 
and P(O) + P(1) for k = 0. Thus, the mean number of cars in the 
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queue is given by 


co k+1 2 
1-4) yk (4) payee 
O-B)R&e) =a =H 
To find the mean time a customer spends waiting in the queue, it is 


necessary first to find the distribution of waiting times, and then to find 
its mean. We shall omit this work, stating only that the resulting mean 


waiting time is 


na 
B(B — &) 


EXERCISES 


(A) Prove that for the Poisson distribution 
f@e+1;4__A 
fa) xtl 
yor x = Oy dy Ze 


/ 2, Use the recursion formula of Exercise 1 to calculate the values of the Poisson 
distribution with 2 = 3 for x = 0, 1, 2,..., and 9, and draw the histogram 
/) of this distribution. Verify your results by seferting to Table 2. 


{ h 3./ Using Table 2, find 


IY (a) FA; 7): (b) #(4; 7); 
(c) Lk; 8). 

fa. JUsing Table 2, find 

~—"(a) FQ; 12); (b) F(9; 12); 


© Yfks 75). 


[sy ‘Use the Poisson distribution to approximate the binomial probability 
b(3; 100, 0.03). 


6. A large shipment of books contains 2 percent with imperfect bindings. Use 
the Poisson approximation of the binomial distribution to determine the 
probability that among 600 books (randomly selected from the shipment) 
(a) at most 15 will have imperfect bindings; 

(b) exactly 15 will have imperfect bindings; 
(c) at least 10 will have imperfect bindings. 


(4. In a given city, 5 percent of all drivers get at least one parking ticket per 
year. Use the Poisson approximation of the binomial distribution to deter- 
mine that among 72 drivers (randomly chosen in this city) 


ee ear 
frrt es 
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(a) exactly 4 will get at least one parking ticket during the next year; 

(b) at least 2 will get at least one parking ticket during the next year; 

(c) anywhere from 4 to 8, inclusive, will get at least one parking ticket during 
- the next year. 







¥ The number of weekly breakdowns of a computer is a random variable 
having a Poisson distribution with 2 = 0.3. What is the probability that the 
computer will operate without a breakdown for two consecutive weeks? 


9. The number of gamma rays emitted per second by a certain radioactive sub- 
~ stance is a random variable having the Poisson distribution with 2 = 5.8. If 
a recording instrument becomes inoperative when there are more than 12 
rays per second, what is the probability that this instrument becomes in- 
operative during any given second? 


Zi A A 0. Given that the switchboard of a consultant’s office receives on the average 
, 0.9 calls per minute, find the probabilities that 
(a) in a given minute there will be at least one incoming call; 
(b) between 10: 00A.M.and 10: 02 A.M. there will be exactly 2 incoming calls; 
(c) during an interval of 4 minutes there will be at most 3 incoming calls. 


11. At a checkout counter customers arrive at an average rate of 1.5 per minute. 
Find the probabilities that 
(a) at most 4 will arrive in any given minute; 
(b) at least 3 will arrive during an interval of 2 minutes; 
(c) at most 15 will arrive during an interval of 6 minutes. 


A2. The determination of geometric probabilities is often Spee by making 
: use of the identity 


eee eee 
g(x; p) = x b(1; x, p) Ke 
and looking up (1; x, p) in a table of binomial probabilities. Verify this 
identity and use it (and Table 1) to evaluate 


(a) g(12; 0.10); (b) g(10; 0.30). 


4 , iS 13. An expert shot hits a target 95 percent of the time. What is the probability 
_/ that he will miss the target for the first time on the 15th shot? 


14. In a “torture test” a light switch is turned on and off until it fails. If the prob- 
ability that the switch will fail any time it is turned on or off is 0.001, what is 
the probability that the switch will fail after it has been turned on or off 1,200 
times. Assume that the conditions underlying the geometric distribution are 
met. [Hint: Use the formula for the value of an infinite geometric progression 
and logarithms. ] 


f 
nv) 


(FAS. In the example on page 82 we considered customers arriving at a cafeteria 

at an average rate of 0.3 per minute. If they are being served at an average 

rate of 0.5 per minute, find 

(a) the average number of customers being served or waiting to be served at 
any given time; 
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(b) the average number of customers waiting to be served at any given time; 
(c) the average time a customer spends waiting in line. 


eo 16. The arrival of trucks at a receiving dock is a Poisson process with a mean 
~ arrival rate of 2 per hour. The trucks can be unloaded at a mean rate of 3 per 
hour, and the time required for unloading has an exponential distribution. 
(a) What is the average number of trucks being unloaded and waiting to be 
unloaded ? . | 
(b) What is the mean number of trucks in the queue? 
(c) What is the mean time a truck spends waiting in the queue? 
(d) What is the probability that there are no trucks waiting to be unloaded ? 


. Referring to Exercise 16, suppose that the cost of keeping a truck in the sys- 
tem is $15 per hour. If it were possible to increase the mean unloading rate to 
3.5 trucks per hour at a cost of $100 per day, would this be worthwhile? 


18,. Verify (by substitution) that the geometric distribution is a solution of the 
recursion relation given on page 84. 


9. Differentiating with respect to p both sides of the equation 


dX Pl — pyr te] 
show that the geometric distribution 
F(x) = p(i — p)*"! for ys = 1, 2, Byes 


has the mean 1/p. 


20. The following is an alternate way of introducing the Poisson distribution. 
Suppose that f(x, ¢) is the probability of getting x successes during a time 
interval of length ¢ when (1) the probability of a success during a very small 
time interval from ¢ to ¢ + Aris «- At, (2) the probability of more than one 
success occurring during such a time interval is negligible, and (3) the prob- 
ability of a success during such a time interval does not depend on what 
happened prior to time f¢. 

(a) Show that 


f(x, t + At) = f(x, HL — “At] + f(x — 1, t)o At 


(b) Using the result of part (a), show that 
AMO OV at f(x — 1,1) —f, 0 


/ (c) Verify by substitution that the differential equation obtained in (b) is 
satisfied by the formula for the Poisson distribution with A = at. 


41. Prove that the mean and the variance of the Poisson distribution are both 
~ equal to A 
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(a) directly, by applying the definitions in Section 3.4; 
(b) by using formulas uw = np and o? = np(1 — p) for the binomial distri- 
bution and the limit argument on page 78. 


3 ¢ Le 22. A company rents out time on a computer for periods of ¢ hours, for which it 
receives $600 an hour. The number of times the computer breaks down 
during ¢ hours is a random variable having the Poisson distribution with 
A = (0.8)t, and if the computer breaks down x times during ¢ hours it costs 
50x? dollars to fix it. How should the company select ¢ in order to maximize 
its expected profit? [Hint: Use results obtained in Section 3.6.] 





THE MULTINOMIAL DISTRIBUTION 


An immediate generalization of the binomial distribution arises when 
each trial can have more than two possible outcomes. This happens, for 
example, when a manufactured product is classified as superior, average, 
or poor, when a student’s performance is judged by giving him an A, 
B, C, D, or F, or when an experiment is judged successful, unsuccessful, 
or inconclusive. To treat this kind of problem in general, let us consider 
the case where there are m independent trials, with each trial permitting 
A mutually exclusive outcomes whose respective probabilities are 


k 
P1>P25+++» Dy (with >) p, = 1). Referring to the outcomes as being of the 
i=1 


first kind, the second kind, ..., andthe kth kind, we shall be interested in 
the probability f(x,,x.,..., x,) of getting x, outcomes of the first kind, x, 


outcomes of the second kind, ..., and x, outcomes of the Ath kind, with 
k 


>) x; = n. Using arguments similar to those which we employed in deriv- 
t=1 


ing the equation for the binomial distribution in Section 3.2, it can be 
shown that the desired probability is given by 


n! 


F Bay Koy 2 GR) = X1!x5!+ 


X1 x2 Xk 
e meee PPE 
are Pa 3 


k 
for x, = 0, 1,..., for each i, subject to the restriction 5) x; =n. The 
t=1 


joint probability distribution whose values are given by these probabilities 
is called the multinomial distribution; it owes its name to the fact that for 
the various values of the x; the probabilities are given by corresponding 
terms of the multinomial expansion of (p, + p, + +--+: + >p,)’. 
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Example. Suppose the probability that the light bulb of a certain kind 
of slide projector will last fewer than 40 hours of continuous use is 
0.30, the probability that it will last anywhere from 40 to 80 hours 
of continuous use is 0.50, and the probability that it will last more 
than 80 hours of continuous use is 0.10. To find the probability that 
among eight such light bulbs two will last fewer than 40 hours, five 
will last anywhere from 40 to 80 hours, and one will last more than 80 
hours, we substitute x, = 2, x, =5, x; =1, p; = 0.30, p, = 9.50, 
and p; = 0.20 into the formula, and we get 


fQ,5, 1) = a (0.30)2(0.50)5(0.20)! 


= 0.0945 


EXERCISES 


1. Suppose that the probabilities are, respectively, 0.40, 0.40, and 0.20 that in 
city driving a certain kind of imported car will average less than 22 miles per 
gallon, anywhere from 22 to 25 miles per gallon, or more than 25 miles per 
gallon. Find the probability that among 12 such cars tested, four will average 
less than 22 miles per gallon, six will average anywhere from 22 to 25 miles per 
gallon, and two will average more than 25 miles per gallon. 


2. Suppose that the probabilities are, respectively, 0.60, 0.20, 0.10, and 0.10 that 
an income tax form will be filled in correctly, that it will contain an error 
favoring the taxpayer, that it will contain an error favoring the government, 
or that it will contain both kinds of errors. Find the probability that among 10 
of the income tax forms randomly selected for audit five will be correct, three 
will contain an error favoring the taxpayer, one will contain an error favoring 
the government, and one will contain both kinds of errors. 


3. Using the same sort of reasoning as in the derivation of the formula for the 
hypergeometric distribution, we can derive a formula which is analogous to the 
multinomial distribution but applies to sampling without replacement. If a set 
of N objects contains a, objects of the first kind, a, objects of the second 
kind, ..., and a, objects of the kth kind, so that a; +a, +...+ta,=N, 
the number of ways in which we can select x; objects of the first kind, x, 
objects of the second kind, ... , and x, objects of the Ath kind is given by the 
product of the number of ways in which we can select x; of the a; objects of 
the first kind, x, of the a, objects of the second kind, ..., and x, of the a; 
objects of the kth kind. Then, the probability of obtaining that many objects 
of each kind is simply this product divided by the total number of possibilities, 
namely, the number of ways in which x; + x2 +... +x, =n objects can 
be selected from the whole set of N objects. 
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(a) Write a formula for the probability of thus obtaining x, objects of the 
first kind, x, objects of the second kind, ..., and x; objects of the kth 
kind. 

(b) If 20 defective glass bricks include 10 that have cracks, seven that are 
discolored, and three that have cracks and discoloration, what is the prob- 
ability that among six of the bricks chosen at random for further checks 
three will have cracks, two will be discolored, and one will have cracks as 
Well as discoloration ? 






.10 SIMULATION 


In recent years, simulation techniques have been applied to many 
problems in the various sciences, and if the processes which are being 
simulated involve an element of chance, these techniques are referred to 
as Monte Carlo methods. Very often, the use of Monte Carlo simulation 
eliminates the cost of building and operating expensive equipment; it is 
used, for instance, in the study of collisions of photons with electrons, 
the scattering of neutrons, and similar complicated phenomena. Monte 
Carlo methods are also useful in situations where direct experimentation 
is impossible—say, in studies of the spread of cholera epidemics, which, 
of course, are not induced experimentally on human populations. In 
addition, Monte Carlo techniques are sometimes applied to the solution 
of mathematical problems which actually cannot be solved by direct 
means, or where a direct solution is too costly or requires too much 
time. | 

A classical example of the use of Monte Carlo methods in the solution 
of a problem of pure mathematics is the determination of z (the ratio of 
the circumference of a circle to its diameter) by probabilistic means. 
Early in the eighteenth century George de Buffon, a French naturalist, 
proved that if a very fine needle of length ais thrown at random on a board 
ruled with equidistant parallel lines, the probability that the needle will 
intersect one of the lines is 2a/zb, where 5 is the distance between the 
parallel lines. What is remarkable about this fact is that it involves the 
constant z = 3.1415926 ..., which in elementary geometry is approxi- 
mated by the circumferences of regular polygons enclosed in a circle of 
radius 4. Buffon’s result implies that if such a needle is actually tossed 
a great many times, the proportion of the time it crosses one of the lines 
gives an estimate of 2a/zb and, hence, an estimate of z since a and b are 
known. Early experiments of this kind yielded an estimate of 3.1519 
(based on 5,000 trials) and an estimate of 3.155 (based on 3,204 trials) in 
the middle of the nineteenth century. 

Although Monte Carlo methods are sometimes based on actual gam- 
bling devices (for example, the needle tossing in the estimation of 7), it is 
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usually expedient to use so-called tables of random digits or random 
numbers. Tables of random numbers consist of many pages on which the 
digits 0, 1, 2,..., and 9 are set down in a “random” fashion, much as they 
would appear if they were generated one at a time by a gambling device 
giving each digit an equal probability of being selected. Actually, we 
could construct such tables ourselves—say, by repeatedly drawing num- 
bered slips out of a hat or by using a perfectly constructed spinner—-but 
in practice such tables are usually generated by means of electronic com- 
puters. For instance, in one such method the computer begins with a 4- 
digit number, say, 3571, and squares it, getting 12752041. The middle four 
digits, in this case 7520, are looked upon as a set of 4 random digits. (If 
necessary, a 0 is added on the left to make the square have eight digits.) 
Then 7520 is squared, yielding 56550400, and 5504 is looked upon as the 
next set of 4 random digits. Continuing in this way, we obtain a table of 
pseudo-random digits which are quite satisfactory for most practical pur- 
poses. 

Although tables of random numbers are constructed so that the digits 
can be looked upon as values of a random variable having the discrete 
uniform distribution f(x) = ;, for x =0, 1, 2,..., or 9, they can be 
used to simulate values of any discrete random variable, and even con- 
tinuous random variables. 


Example. For instance, with the use of random numbers we can 
simulate flips of coins in many different ways. One possibility is to let 
0, 2, 4, 6, 8 represent heads, 1, 3, 5, 7, 9 represent tails, and to sim- 
ulate, say, three flips of a coin we can pick sets of three random digits. 
Thus, if we use the second page of Table VII and the digits in the 
9th, 10th, and 11th columns starting from the top, we get 


480 280 085 265 303 288 295 388 127 222 


and we interpret this as getting, respectively, 3, 3, 2, 2, 1, 3, 1, 2,1, and 
3 heads in 10 flips of 3 coins. Since the probabilities of getting 0, 1, 2, or 
3 heads with 3 balanced coins are, respectively, 4, 3, 3, and 4, as can 
easily be verified, we could also use the scheme shown in the following 
table: 


Number Random 
of heads Probability digits 
0 4 0 
1 3 | ea, 
2 3 4,5, 6 
3 t | 
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Omitting the digits 8 and 9 wherever they may occur, we would, thus, 
interpret the random digits 


4811193337636265893106 


in the fifth row of the third page of Table 7 as getting, respectively, 
2, 15-15 1, de 1, 332, 1, 2s 15.2, 2,16 1,0, and 2 heads im 18 tosses -of 
three coins. 


Of the two methods used in this example, the first has the disadvantage 
that we still have to count how many of the digits are even and how many 
are odd; the second has the disadvantage that some of the digits will have 
to be omitted, and this may conceivably lead to situations where many 
random digits will have to be omitted before we find enough that can be 
used. To avoid such difficulties, it is generally preferable to express the 
probabilities as decimals (rounded, if necessary). 


Continuation of Example. Since the probabilities for 0, 1, 2, or 3 
heads in three flips of a coin are, respectively, 0.125, 0.375, 0.375, 
and 0.125, we can use three-digit random numbers and the following 
scheme: 


Number Random 

of heads Probability numbers 
0 0.125 000-124 
1 0.375 125-499 
2 0.375 500-874 
3 


0.125 875-999 


Note that we allocated 125 (or one-eighth) of the random numbers from 
000 through 999 to 0 heads, 375 (or three-eighths) to 1 head, 375 (or 
three-eighths) to 2 heads, and 125 (or one-eighth) to 3 heads. If we 
arbitrarily used the 22nd, 23rd, and 24th columns of the first page of 
Table 7 starting with the 6th row, we would get 


197 565 157 520 946 951 948 568 586 089 


and we would interpret this as getting 1, 2, 1, 2, 3, 3, 3, 2, 2, and 0 
heads. 


The allocation of random numbers to the various values of a random 
variable can be facilitated by referring to the corresponding cumulative 
probabilities, as is illustrated by the following example: 
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Example. Suppose that the probabilities are, respectively, 0.082, 
0.205, 0.256, 0.214, 0.134, 0.067, 0.028, 0.010, 0.003, and 0.001 that 
0, 1, 2,3,..., or 9 cars will arrive at a toll booth of a turnpike in any 
given minute during the early afternoon. To simulate the arrival of 
cars at this toll booth, we might use the following scheme: 


Number Cumulative Random 

of cars Probability probability numbers 
0 0.082 0.082 000-081 
1 0.205 0.287 082-286 
2 0.256 0.543 287-542 
3 0.214 O.757 543-756 
4 0.134 0.891 757-890 
5 0.067 0.958 891-957 
6 0.028 0.986 958-985 
7 0.010 0.996 986-995 
8 0.003 0.999 996-998 
9 0.001 1.000 999 


Note that in each case the /ast random digit is one Jess than the number 
formed by the last three digits of the corresponding cumulative prob- 
ability; of course, the cumulative probabilities are the respective 
probabilities that there will be at most 0 cars, at most 1 car, at most 
2 cars, and so on. Now, if we simulate the arrival of cars at the toll 
booth by using the 5th, 6th, and 7th columns of the fourth page of 
Table 7 starting with the 11th row, we get 


036 417 962 458 778 541 869 379 973 553 
325 674 907 710 709 499 493 384 346 301 


This means that during 20 one-minute intervals the number of cars 
arriving at the toll booth is 0, 2, 6, 2, 4, 2, 4, 2, 6, 3, 2, 3, 5, 3, 3, 2, 2, 
2, 2, and 2. 


. EXERCISES 


wv Suppose that the probabilities are, respectively, 0.41, 0.37, 0.16, 0.05, and 0.01 
that there will be 0, 1, 2, 3, or 4 UFO sightings in a certain region on any one 
day. 

(a) Distribute the two-digit random numbers from 00 through 99 to the five 
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values of this random variable, so that the corresponding random num- 
bers can be used to simulate the sighting of UFO’s in the given region. 
(b) Use the result of part (a) to simulate the sighting of UFO's in the given 


Po region on 25 days. 

. Referring to the example on page 79 and the probabilities shown in Figure 
3.7, distribute the three-digit random numbers from 000 through 999 to the 11 
values of the random variable, and simulate the fire insurance company’s 
experience (that is, that number of policyholders who file at least one claim) 

Lt a period of 20 years. 


. The following is the probability distribution of the number of telephone calls 
that are received by the switchboard of a brokerage house per minute: 


Number of 
calls 


Probability 0.2466 0.3452 0.2417 0.1128 0.0395 0.0111 0.0026 0.0005 


(a) Distribute the four-digit random numbers from 0000 through 9999 to the 
eight values of this random variable, so that the corresponding random 
numbers can be used to simulate the number of calls arriving at the 
switchboard of this brokerage house. 

(b) Use the result of part (a) to simulate the number of calls arriving at the 
switchboard during 40 consecutive minutes. 


4. Depending upon the availability of parts, a company can manufacture 3, 4, 5, 
or 6 units of a certain item per week with corresponding probabilities of 0.10, 
0.40, 0.30, and 0.20. The probabilities that there will be a weekly demand for 
0, 1, 2, 3,..., or 8 units are, respectively, 0.05, 0.10, 0.30, 0.30, 0.10, 0.05, 
0.05, 0.04, and 0.01. If a unit is sold during the week that it is made, it will 
yield a profit of $100; this profit is reduced by $20 for each week that a unit 
has to be stored. Use random numbers to simulate the operations of this com- 
pany for 50 consecutive weeks and estimate their expected weekly profit. 








Probability 
Densities 


AS4.1 CONTINUOUS RANDOM VARIABLES 





When we first introduced the concept of a random variable in Chapter 
3, we presented it as a real-valued function defined over the sample 
space of an experiment, and we illustrated this idea with the random 
variable giving the number of preferred ratings which a lawn mower 
received, assigning the numbers 0, 1, 2, or 3 (whichever was appropriate) 
to the 18 possible outcomes of the experiment. In the continuous case, 
where random variables can assume values on a continuous scale, the 
procedure is very much the same. The outcomes of an experiment are 
represented by the points on a line segment or a line, and the value of a 
random variable is a number appropriately assigned to each point by 
means of some rule or equation. When the value of a random variable is 
given directly by a measurement or observation, we usually do not bother 
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to differentiate between the value of the random variable, the measure- 
ment which we obtain, and the outcome of the experiment, the correspond- 
ing point on the real axis. Thus, if an experiment consists of determining 
what force is required to break a given tensile-test specimen, the result 
itself, say, 138.4 pounds, is the value of the random variable with which 
we are concerned. There is no real need in that case to add that the sample 
space of the experiment consists of all (or part of) the points on the positive 
real axis. 

The problem of defining probabilities in connection with continuous 
sample spaces and continuous random variables involves some com- 
plications. To illustrate the nature of these complications, let us consider 
the following situation: 


Example. Suppose we want to know the probability that if an accident 
occurred on a freeway whose length is 200 miles, it would happen at 
some given location or, perhaps, some particular stretch of the road. 
The outcomes of this experiment can be looked upon as a continuum 
of points, namely, those on the continuous interval from 0 to 200. 
Suppose the probability that the accident occurred on any interval of 
length L is L/200, with L measured in miles. Note that this arbitrary 
assignment of probability is consistent with Axioms | and 2 on page 
18, since the probabilities are all nonnegative and less than or equal 
to 1, and P(S) = 299 = 1. Of course, we are considering so far only 
events represented by intervals which form part of the line segment 
from 0 to 200. Using Axiom 3’ on page 78, we can also obtain prob- 
abilities of events which are not intervals but which can be represented 
by the union of finitely many or countably many intervals. Thus, for 
two nonoverlapping intervals of length L, and L, we have a probability 
of 
L,4+L, 
200 


and for an infinite sequence of nonoverlapping intervals of length L,, 
L,, L;,..., we have a probability of 


L+h+bs+... 
200 


Note that the probability that the accident occurred at any given point 
is equal to zero because we can look upon a point as an interval of zero 
length. However, the probability that the accident occurred in a very 
short interval is positive; for instance, for an interval of length 1 foot 
the probability is 9.5(10)~’. 
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Thus, in extending the concept of probability to the continuous case, 
we again use Axioms 1, 2, and 3’, but we shall have to restrict the meaning 
of the term “event.” So far as practical considerations are concerned, this 
restriction is of no consequence; we simply do not assign probabilities 
to some rather abstruse point sets, which cannot be expressed as the 
unions or intersections of finitely many or countably many intervals. 

The way in which we assigned probabilities in the preceding example 
is, Of course, very special; it is similar in nature to the way in which we 
assign equal probabilities to the six faces of a die, heads and tails, the 52 
cards in a standard deck, and so forth. To treat the problem of associating 
probabilities with continuous random variables more generally, suppose 
we are interested in the probability that a given random variable will 
take on a value on the interval from a to b, where a and 5b are constants 
with a < b. Suppose, furthermore, that we divide the interval from ato 
b into n equal sub-intervals of width Ax containing, respectively, the 
points x;,x2,...,X,, and that the probability that the random variable 
will take on a value on the sub-interval containing x, is given by f(x;)- Ax. 
Then, the probability that the random variable with which we are con- 
cerned will take on a value on the interval from a to 5 is given by 


Pa<x<b)=¥ f(x)-Ax 


Now, if f is an integrable function defined for all values of the random 
variable with which we are concerned (x > 0 in this case), we shall define 
the probability that the value of the random variable falls between a and b 
by letting Ax — 0, namely, as 


PeZ22=2 = [ f@oax 





This definition of probability in the continuous case presupposes the 
existence of an appropriate function f which, integrated from any con- 
stant a to any constant b (with a< bd), gives the probability that the cor- 
responding random variable takes on a value on the interval from a to 
b. Note that a value f(x) of f does not give the probability that the cor- 
responding random variable takes on the value x; in the continuous case 
probabilities are given by integrals and not by the values of f. The probability 
that a random variable actually takes on the value x may be obtained by 
first considering the probability that it takes on a value on the interval 
from x — Ax to x + Ax. However, if we then let Ax — 0, it becomes 


99 4.1 Continuous Random Variables 


apparent that the probability is zero that a continuous random variable 
takes on any given value x. 

The fact that this probability is always zero should not be disturbing. 
In fact, our definition of probability for the continuous case provides a 
remarkably good model for dealing with measurements or observations. 
Owing to the limits of our ability to measure, experimental data never 
seem to come from a continuous sample space. Thus, while temperatures 
are fruitfully thought of as points on a continuous scale, any temperature 
measurement actually represents an interval on this scale. If we report a 
temperature measurement of 74.8 degrees centigrade, we really mean 
that the temperature lies in the interval from 74.75 to 74.85 degrees 
centigrade, and not that it is exactly 74.8000.... It is important to add 
that when we say that there is a zero probability that a random variable 
will assume any given value x, this does not mean that it is impossible 
actually to get the value x. In the continuous case, a zero probability does 
not imply logical impossibility, but the whole matter is largely academic 
since, owing to the limitations of our ability to measure and observe, we 
are always interested in probabilities connected with intervals and not 
with isolated points. 

As an immediate consequence of the fact that in the continuous case 
probabilities associated with individual points are always zero, we find 
that if we speak of the probability associated with the interval from a to 
b, it does not matter whether either endpoint is included; symbolically, 


Pa<x<b)=Pla<x<b)=Pa<x<b)=Pa<x<b) 


Drawing an analogy with the concept of a density function in physics, 
we call the functions f, whose existence we stipulated in extending our 
definition of probability to the continuous case, a probability density 
function, or simply a probability density. Whereas density functions are 
integrated to obtain weights, probability density functions are integrated 
to obtain probabilities. 

Since a probability density, integrated between any two constants 
a and b, gives the probability that a random variable assumes a value | 
between these limits, f cannot be just any real-valued integrable function. 
However, if we impose the conditions that: 


1. f(x) > 0 for all x within the domain of f. 


2. Jf ax = 1. 
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it can be shown that the axioms of probability (with the modification 
discussed on page 98) are satisfied. Note the similarity between these 
properties and those given on page 53 for probability distributions. 

As in the discrete case, we shall write as F(x) the probability that a 
random variable with the probability density f takes on a value less than 
or equal to x, and we shall refer to the corresponding function F as the 
cumulative distribution function, or simply the distribution function of the 
random variable. Thus, if a random variable with values x has the prob- 
ability density f, the values of its distribution function are given by 





F(x) = io f(t) dt 





It immediately follows from this definition that 
P(a<=, X=, b) = Fb) — Fla) 
and according to the fundamental theorem of integral calculus that 


SS we FG 


wherever this derivative exists. 


Example. To illustrate the concepts we have introduced, let us con- 
sider a random variable whose probability density function is given by 


26 ** forx >0 
ne y=) for x= 0 


Note that, although the random variable cannot assume negative 
values, we artificially extended the domain of f to include all the real 
numbers. This is a practice we shall follow throughout this book. It 
is also apparent from the graph of this function in Figure 4.1 that it 
has a discontinuity at x = 0; indeed, a probability density function 
need not be everywhere continuous, so long as it is integrable between 
any two limits a and b (with a < b). 

Leaving it to the reader to verify that the given function satisfies 
the two conditions on page 99 and that it can, therefore, serve as a 
probability density function, let us now find some probabilities con- 
nected with a corresponding random variable. The probability that 
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Figure 4.1. Graph of density function. 


it will take on a value between 1 and 3 is 
3 
Pi<x<3)= 2e-2* dx = e-? — e-§ = 0,135 
1 
and the probability that it will take on a value greater than 0.5 is 
P(x > 0.5) = { 2e-2* dx = e-! = 0,368 
0.5 
The cumulative distribution function which corresponds to this 
example is given by 
0 forx <0 


F — E 
(x) i 2e-* dt = 1 — e=* for0 <x 
0 


and it can be shown, for example, that 
P(x < 1) = F(1) = 1 — e-? = 0.865 


Note that the distribution function of this example is nondecreasing and 
that F(—c) = 0 and F(co) = 1. Indeed, it can be shown that these 
properties are shared by all cumulative distribution functions. 

The measures used to describe probability densities are very similar 
to the ones which we used to describe probability distributions. Replacing 
summations with integrals, we define the kth moment about the origin as 





102 Ch. 4 Probability Densities 





analogous to the definition we gave on page 72. The first moment about 
the origin is again referred to as the mean and it is denoted as Lt; as before, 
it is the expected, or average value of a random variable having the prob- 
ability density f. We also define the kth moment about the mean as 


Me = | &— wf) ax 





In particular, the second moment about the mean is again referred to as 
the variance and it is written as o?; as before, it measures the spread of 
a probability density (or its graph) in the sense that it gives the expected 
or average value of the squared deviations from the mean. 


Continuation of Example. Applying these formulas to our example, 
we find that the mean is 


h= J, ee ees 
and that the variance is 
oz = ii (x — 4)%-2e72* dx = 4 


In an example like this, the calculation of ¢2 may be simplified by using, 
as on page 72, the formula 0? = yw, — p?. 


EXERCISES 


A. Verify that the function given in the example on page 100 is, in fact, a prob- 


ability density. 
I . Fé If the probability density of a random variable is given by 


k(1 — x?) for O-= x ac 1 
f(x) = ‘0 
elsewhere 
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find 
(a) the value of k; 
(b) the probability that a random variable having this probability density 
will take on a value between 0.1 and 0.2; 
(c) the probability that a random variable having this probability density 
sends will take on a value greater than 0.5; 
en (d) the distribution function which corresponds to this probability density. 
to oe [Hint: F(x) must be given separately for x <0,0 <x <1], and x > 1.] 
Aiso use the result of part (d) to find 
(e) the probability that a random variable having this distribution function 
vill take on a value less than 0.3; 
(f) Athe probability that a random variable having this distribution function 


will take on a value between 0.4 and 0.6. 







* If the probability density of a random variable is given by 


hit ¢ 


x for0 <x <1 
J) H42—x% for l= x <2 
0 elsewhere 


find 

(a) the probability that a random variable having this probability density 
will take on a value between 0.2 and 0.8; 

(b) the probability that a random variable having this probability density 
will take on a value between 0.6 and 1.2; | 

A. 5 —-(€) the distribution function which corresponds to this probability density. 

[Hint: F(x) must be given separately forx <0,0<x< 11h = 2, 
and x > 2.] 

Also use the result of part (c) to find 


(d) the probability that a random variable having this distribution function 
will take on a value greater than 1.8; 

(e) the probability that a random variable having this distribution function 
will take on a value between 0.4 and 1.6. 





Ziven the probability density f(x) = for —co < x < o, find k. 





k 
1+ x? 


5, If the distribution function of a random variable is given by 


4 
rafts forx>2 
0 forx <2 


find 

(a) the probability that this random variable will take on a value less than 3; 

(b) the probability that this random variable will take on a value between 4 
and 5; 

(c) the probability density of this random variable. Are there any points at 
which it is undefined? 
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Also sketch the graphs of the distribution function and the probability 
/density. 


6. In certain experiments, the error made in determining the density of a silicon 
compound is a random variable having the probability density 


2 for —0.02 < x < 0.02 
0 elsewhere 


fx) = 


find 
(a) the probability that such an error will be between —0.03 and 0.04; 
Ab) the probability that the magnitude of such an error will be less than 0.005. 


/1. The mileage (in thousands of miles) which car owners get with a certain kind 
of tire is a random variable having the probability density 


fe) por forx > 0 
XxX = 
0 for x <0 


Find the probabilities that one of these tires will last 
(a) at most 10,000 miles; 

(b) anywhere from 16,000 to 24,000 miles; 

(c) at least 30,000 miles. 


8. Prove that the identity o2 = yw, — yw? holds for any probability density for 
which these moments exist. . 


9. Find 4 and o? for the probability density of Exercise 2. 
10. Find yu and o? for the probability density of Exercise 3. 
11. Find yz and o? for the distribution of the random variable of Exercise 5. 


12, Find yz and o for the distribution of the errors of measurement of 
Exercise 6. 


13. Show that for the probability density of Exercise 4, yw’, and, therefore, a? do 
_ hot exist. 
fmonrs Mr /{ 


2 THE NORMAL DISTRIBUTION 
¥ 


Among the special probability densities we shall study in this chapter, 
the normal probability density, usually referred to simply as the normal 
distribution, is by far the most important.* It was studied first in the 
eighteenth century when scientists observed an astonishing degree of 


*The words “density” and “distribution” are often used interchangeably in the 
literature of applied statistics. 
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regularity in errors of measurement. They found that the patterns (dis- 
tributions) they observed were closely approximated by a continuous 
curve which they referred to as the “normal curve of errors” and attributed 
to the laws of chance. The equation of the normal probability density, 
whose graph (shaped like the cross section of a bell) is shown in Figure 
4.2, is 


floss bso) = 5s et) —o<x< oo 


210 





and in Exercises 18 and 19 on page 112, the reader will be asked to verify 
that its parameters y and a are, indeed, its mean and its standard devia- 
tion. 





Figure 4.2. Graph of normal probability density. 


Since the normal probability density cannot be integrated in closed 
form between every pair of limits a and b, probabilities relating to normal 
distributions are usually obtained from special tables, such as Table 3 
at the end of this book. This table pertains to the standard normal distribu- 
tion, namely, the normal distribution which has uw = 0 and o = 1, and 
its entries are the values of 


1 Zz 
a ee eet —4r? 
FQ) = in [. ee dt 


for z = 0.00, 0.01, 0.02, ..., 3.49. To find any probability Pa <z <5), 
where z is the value of a random variable having the standard normal 
distribution, we use the equation P(a < z < b) = F(6) — F(@), and if 
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either a or b is negative, we also have to use the identity F(—z) = 1 — F(z), 
which the reader will be asked to verify in Exercise 17 on page 112. 


Example. For instance, the probability that a random variable having 
the standard normal distribution will take on a value between 0.87 and 
1,28 is 


P(0.87 <z< 1.28) = F(1.28) — F(0.87) 
— 0.8997 — 0.8078 
= 0.0919 


and the probability that it will take on a value between —0.34 and 
0.62 is 


P(—0.34 < z< 0.62) = F(0.62) — F(—0.34) 
= 0.7324 — (1 — 0.6331) 
= 0.3655 


Also, the probability that it will take on a value greater than 0.85 is 
P(0.85 < z) = 1 — F(0.85) 
= 1 — 0.8023 
= 0.1977 


and the probability that it will take on a value greater than —0.65 is 
P(—0.65 < z) = 1 — F(—0.65) 
= 1—[l — F(0.65)] 
= F(0.65) 
= 0.7422 


To use Table 3 in connection with normal distributions other than the 
standard normal distribution, we have only to make use of the following 
theorem: 


THEOREM 4.1 Jf a random variable has the values x and a distri- 
bution with the mean yu and the variance o*, the corresponding 
standardized random variable has the values 


and its distribution has zero mean and unit variance. 





107 4.2 The Normal Distribution 


To prove this result, we have only to set up the integral for the probability 
and we will find that 





P(x <a), make the change of variable z = a 


P(x < a) equals F (A A— tt), which is the corresponding value of the cumu- 


lative distribution ection of the standard normal distribution. 

Thus, if we want to find the probability that a random variable having 
a normal distribution with the mean yw and the variance a? will take on 
a value between a and b, we have only to calculate the probability that a 
random variable having the standard normal distribution will take on 





a value between = and a ; symbolically, 


racxabh=r(hs#)—A2Z4) 


oO 








where F (? i ) and F (2— 4) can be obtained from Table 3. 
Example. It has been found through experience that a large variety 
of physical measurements have approximately normal distributions. 
Thus, suppose that the amount of cosmic radiation to which a person 
is exposed while flying by jet across the United States is a random 
variable having a normal distribution with mean of 4.35 mrem* 
and a standard deviation of 0.59 mrem. Then, the probability that 
a person will be exposed to anywhere from 4.00 to 5.00 mrem on such 
a flight is 


5.00 — 4.35 4.00 — 4.35) _ ee 

F(200 439) Z FAS a) — F(1.10) — F(—0.59) 
0.8643 = (1 +0922) 
= 0.5867 


and the probability that a person will be sipesed to at least 5.50 mrem 
on such a flight is 


5.50 — 4.35 
i Fe) = 1 _ F(1.95) 
— | — 0.9744 
— 0.0256 


*This unit of radiation stands for “milli roentgen equivalent man.” 
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Example. To consider a somewhat different example, suppose that 
the actual amount of instant coffee which a filling machine puts into 
4-ounce jars varies somewhat from jar to jar, and that it can be looked 
upon as a random variable having a normal distribution with a 
standard deviation of 0.04 ounce. If only 2 percent of the jars are to 
contain less than 4.00 ounces, let us determine the average amount 
of coffee which the filling machine will have to put into these jars. This 
example differs from the preceding one insofar as we are given a 
percentage (or probability), we are given o = 0.04, and we are asked 
to find yz. Since the value of z for which the entry in Table 3 comes 
closest to 1.0000 — 0.0200 = 0.9800 is 2.05, we have 


4.00 — u 


and, solving for uw, we get uw = 4.082 ounces. Since this may be un- 
satisfactory so far as the food processors are concerned, the reader 
will be asked to show in Exercise 14 on page 112 that if the variability 
of the filling machine is reduced so that o = 0.025 ounce, this will 
lower the required average amount of coffee to 4 = 4.05 ounces, yet 
keep just about 98 percent of the jars above four ounces. 


2€4.3 THE NORMAL APPROXIMATION 
OF THE BINOMIAL DISTRIBUTION 


In some books the normal distribution is introduced as a probability 
density which can be used to approximate the binomial distribution when 
n is large and P, the probability of a success, is close to 0.50 and, hence, 
not small enough to use the Poisson approximation. Thus, let us state, 
without proof, the following theorem: 


THEOREM 4.2. If x is a value of a random variable having the bino- 
mial distribution with the parameters n and p, and if 


es 
»/np(l — p) 


then the limiting form of the distribution function of this standard- 
ized random variable as n — oo is given by 


F(z) = sage a 00 < Z < ce 


Zz 
— oo 
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Note that although x takes on only the values 0, 1, 2,...,, in the limit 
as n—> oo the distribution of the corresponding standardized random 
variable is continuous, and the corresponding density is the standard 
normal density. It will be explained in the examples which follow how 
we account for the fact that a probability distribution is approximated 
by means of a continuous probability density. 


Example. Suppose that 20 percent of the diodes made in a certain 
plant are defective, so that they have to be inspected carefully before 
being shipped. Let us find the probability of obtaining at most 15 
rejects (defectives) in a lot of 100 chosen for inspection. Assuming 
that the assumptions underlying the binomial distribution are met, the 
exact answer is given by 


5" b(x; 100, 0.20) = B(15; 100, 0.20) 
x=0 


Since Table 1 does not include values of n as large as 100, let us appro- 
ximate this probability by using the normal distribution with the 
mean “ = np = 100(0.20) = 20 and the variance ao? = np(1 — p) = 
100(0.20)(0.80) = 16. Observe from Figure 4.3 that we are actually 
approximating the sum of the areas of the first 16 rectangles (cor- 





Figure 4.3. Normal approximation of binomial distribution. 
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responding to x = 0, 1, 2,..., 15) of the histogram of the binomial 
distribution by means of the shaded area under a continuous curve. 
Thus, we must find the area under the approximating normal curve 
not between 0 and 15, but between —0.5 and 15.5 or, what is practically 
the same, between —co and 15.5. Making this continuity correction, 
we obtain 


B(15; 100, 0.20) F(BE =) 


4 
= F(—1.13) = 0.1292 


which is close to the exact value B(15; 100, 0.20) = 0.1285 (obtained 
from the H. G. Romig table referred to in the Bibliography at the 
end of the book). 
Had we been interested in the probability of getting exactly 15 
defectives, we would make the continuity correction of representing 15 
- by means of the interval from 14.5 to 15.5, thus getting 
b(15: 100, 0.20) ~ r(P2) = p(A2*) 


= F(—1.13) — F(—1.38) 
= 0.0454 


This is fairly close to the exact value 6(15; 100, 0.20) = 0.0481 
(obtained again from the H. G. Romig table). 


Note that whenever the normal distribution is used to approximate the 
binomial distribution, each x is replaced by the corresponding interval 
from x —4 to x + 4; this is the continuity correction referred to in the 
above examples. 


Arno 
/ EXERCISES 


A a random variable has the standard normal distribution, find the prob- 
ability that it will take on a value 


(a) less than 1.50; (b) less than —1.20; 
(c) greater than 2.16; (d) greater than —1.75; 
(e) between O and 3.00; (f) between 1.22 and 2.35; 


(g) between —1.33 and —0.33; (h) between —1.60 and 1.80. 


111, 4.3 The Normal Approximation of the Binomial Distribution 


Alig 4 Find z if the probability that a random variable having the standard normal 
distribution will take on a value 
(a) less than z is 0.9911; 
(b) greater than z is 0.1093; 
(c) greater than z is 0.6443; 
(d) less than z is 0.0217; 
(e) between —z and z is 0.9298. 


Ah . 2. 3. If a random variable has a normal distribution, what is the probability that 
it will take on a value 
(a) within one standard deviation of the mean; 
(b) within two standard deviations of the mean; 


) within three standard deviations of the mean? 
W/ af. // 1/ ‘tr ,7 
A. Given a random variable having the normal distribution with w = 65 and 


o = 20, find the probabilities that it will take on a value 

(a) less than 53.2; (b) greater than 70.0; 

(c) between 83.2 and 95.7; (d) between 61.2 and 68.8; 
(e) between 35.6 and 45.6; 

(f) less than 58.6 or greater than 70.9. 


5. Given a random variable having the normal distribution with the mean 16.2 
and the variance 1.5625, find the probabilities that it will take on a value 


(a) greater than 16.8; (b) less than 14.9; 
a between 13.6 and 18.8; (d) between 16.5 and 16.7. 
. Ifarandom variable has the binomial distribution with n = 20 and p = 0.60, 


use the normal curve approximation to determine 

(a) the probability that it will take on the value 14; 

(b) the probability that it will take on a value less than 12. 

Compare both results with the corresponding probabilities obtained with the 
use of Table 3. 


7. A manufacturer knows that on the average 2 percent of the electric toasters 
which he makes will require repairs within 90 days after they are sold. Use 
the normal approximation of the binomial distribution to determine the 
probability that among 1,200 of these toasters at least 30 will require repairs 

ithin the first 90 days after they are sold. 


y. ‘34 4. The probability that an electronic component will fail in less than 1,000 
hours of continuous use is 0.25. Use the normal approximation to find the 
probability that among 200 such components fewer than 45 will fail in less 

than 1,000 hours of continuous use. 


vy ; 2 g 9. A safety engineer feels that 30 percent of all industrial accidents in his plant 
are caused by failure of employees to follow instructions. If this figure is 
correct, find, approximately, the probability that among 84 industrial acci- 
dents in this plant anywhere from 20 to 30 (inclusive) will be due to failure of 
employees to follow instructions. 


10. If 3 of all clouds seeded with silver iodide show spectacular growth, what is 
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A. 


4. 1G 12. 


dr 


14. 


16. 


the probability that among 45 clouds thus seeded at most 25 will show spec- 
tacular growth? 


The burning time of an experimental rocket is a random variable having the 
normal distribution with “4 = 4.76 seconds and a = 0.04 second. What is 
the probability that this kind of rocket will burn 

(a) less than 4.66 seconds; 

(b) more than 4.80 seconds; 

(c) anywhere from 4.70 to 4.82 seconds ? 


In a photographic process, the developing time of prints may be looked upon 
as a random variable having the normal distribution with a mean of 16.28 
seconds and a standard deviation of 0.12 second. Find the probability that it 
will take 

(a) anywhere from 16.00 to 16.50 seconds to develop one of the prints; 

(b) at least 16.20 seconds to develop one of the prints; 

(c) at most 16.35 seconds to develop one of the prints. 


. Specifications for a certain job call for washers with an inside diameter of 


0.300 + 0.005 inch. If the inside diameters of the washers supplied by a given 
manufacturer may be looked upon as a random variable having the normal 
distribution with 4“ = 0.302 inch and @ = 0.003, what percentage of these 
washers will meet specifications ? 


Verify the claim made in the example on page 108, namely, that if the vari- 
ability of the filling machine is reduced so that @ = 0.025 ounce, this will 
lower the required average amount of coffee to 4.05 ounces, yet keep 98 per- 
cent of the jars above four ounces. 


. A stamping machine produces can tops whose diameters are normally dis- 


tributed with a standard deviation of 0.01 inch. At what “nominal” (mean) 
diameter should the machine be set so that no more than 5 percent of the 
can tops produced have diameters exceeding 3 inches? 


Extruded plastic rods are automatically cut into nominal lengths of 6 inches. 

Actual lengths are normally distributed about a mean of 6 inches and their 

standard deviation is 0.06 inch. 

(a) What proportion of the rods exceeds tolerance limits of 5.9 inches to 
6.1 inches ? 


/ (b) To what value does the standard deviation need to be reduced if 99 per- 


AT. 
18. 


19. 


20. 


cent of the rods must be within tolerance? 
Verify the identity F(—z) = 1 — F(z) given on page 106. 


Verify that the parameter y in the expression for the normal density on page 
105 is, in fact, its mean. 


Verify that the parameter a? in the expression for the normal density on 
page 105 is, in fact, its variance. 


Show that the normal density has a relative maximum at x = yu and inflec- 
tion points atx =uU+toa. 
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<7y4.4 OTHER PROBABILITY DENSITIES 





In the application of statistics to problems in engineering and physical 
science, we shall encounter many probability densities other than the 
normal distribution. Among these are the ¢t, F, and chi-square distributions, 
the fundamental sampling distributions which we shall introduce in Chapter 
6, and the exponential and Weibull distributions, which we shall apply 
to problems of reliability and life testing in Chapter 15. In the remainder 
of this chapter we shall discuss five continuous distributions, the uniform 
distribution, the log-normal distribution, the gamma distribution, the beta 
distribution, and the Weibull distribution, for the twofold purpose of giving 
further examples of probability densities and of laying the foundation for 
future applications. 


4.5 THE UNIFORM DISTRIBUTION 


The uniform distribution with the parameters « and f is defined by 
the equation 





] 
f(x) =4h —& 


0 elsewhere 


fore = ax = 6 


and its graph is shown in Figure 4.4. Note that all values of x from « to 
B are “equally likely” in the sense that the probability that x lies in a 





Figure 4.4. Uniform density function. 
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narrow interval of width Ax entirely contained in the interval from « to 
B is equal to Ax/(B — a), regardless of the exact location of the interval. 


Example. To illustrate how a physical situation might give rise to a 
uniform distribution, suppose that a wheel of a locomotive has the 
radius r and that x is the location of a point on its circumference 
measured along the circumference from some reference point 0. When 
the brakes are applied, some point will make sliding contact with the 
rail, and heavy wear will occur at that point. For repeated application 
of the brakes, it would seem reasonable to assume that x is a value 
of a random variable having the uniform distribution with « = 0 
and # = 2zr. If this assumption were incorrect, that is, if some set 
of points on the wheel made contact more often than others, the 
wheel would eventually exhibit “flat spots” or wear out of round. 


To determine the mean and the variance of the uniform distribution, 
we first evaluate the two integrals 


ee ti 1 _a+f 
uy = | x-gh ode y) 








and 
B l oz +a 2 
Hy = | xt ogee = ETE 


Thus, 





and, making use of the formula 0? = yw’, — y?, we find that 


o* = (8 — a)? 
/ 
A 
4.6 THE LOG-NORMAL DISTRIBUTION 


The log-normal distribution occurs in practice whenever we encounter 
a random variable which is such that its logarithm has a normal dis- 
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tribution. This probability density function is given by 


—1 g- (in x-a)2/2p2 forx >0,8B>0 


1 
f(x) = 4n/ 208” 


0 elsewhere 





where “In x” stands for the natural logarithm of x. A graph of the log- 
normal distribution with 7 = 0 and # = 1 is shown in Figure 4.5, and it 
can be seen from the figure that this distribution is positively skewed. 





Figure 4.5. Log-normal density function. 


To find the probability that a random variable having the log-normal 
distribution assumes a value between a and b (0 < a < b), we write 


b 
P(a= x= )) = | apt Sey TEP Gog 


Changing variables by letting y = In x and, hence, dy = x~! dx, we obtain 


In b | 
Pla<x<6)= | Ae mt/ae g 
( = ) Ina 278 4 





and it can be seen that this probability equals the probability that a 
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random variable having the normal distribution with uw = «@ anda = B 
assumes a value between In a and In b. Thus, 


Pla<x<b)= r(22 2) - r(na =a) 


where F(z) is the probability that a random variable having the standard 
normal distribution assumes a value less than or equal to z. 


Example. To illustrate the use of the log-normal distribution, suppose 
that a series of experiments have shown that the current gains of certain 
transistors (which are proportional to the logarithm of J,/J,, the ratio 
of the output to the input current) are normally distributed. If current 
gain 1s measured in units for which it equals In (UJ,/J;), and if it is 
normally distributed with w= 2 and o* = 0.01, then the probability 
that J,/Z;, which thus has the log-normal distribution with « = 2 and 
B = 0.1, will take on a value between 6.1 and 8.2 is given by 


I, _pfln8.2—2\_ ,fin6.1—2 
P(6.1<%<8.2) = F/ 0.1 F( 0.1 


— F(1.0) — F(—2.0) 
= 0.8185 


To find the mean of the /og-normal distribution, we write 


= 
Fc 4 — (In x—0)?/2B2 d 
MX "SE x 


and upon letting y = In x, this becomes 


1 as 
a gece ya— (y—a)*/2 p* 
lu Timp [. eve dy 


This integral can be evaluated by completing the square on the exponent 
y — (vy — @)?/282, thus obtaining an integrand which has the form of a 
normal density. The final result, which the reader will be asked to verify 
in Exercise 3 on page 123, is | 


Lt — ett B/2 
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Similar, but more lengthy, calculations yield 
o2 = e2%+B?(ef? = 1) 


for the variance of the log-normal distribution. 


Continuation of Example. For the distribution of J,/J, (the ratio 
of the output to the input current), we thus find that 


A= e24(0.1)7/2 7 4 


and 


go? = ett(0.1)* 90.1)? _ | — 0,56 
va 


A wm GAMMA DISTRIBUTION 


Several important probability densities for which we shall discuss 
applications later on are special cases of the gamma distribution, whose 
equation is given by 


] 
ped ‘a (@)* 


0 elsewhere 


aed rae forx >0,a>0,8R>0 


where I'(«) is a value of the gamma function, defined by 
(a) = { gee eo Oe 
0 


Integration by parts shows that 
Tia) = (@ — IT — 1) 


for any « > 0 and, hence, that ['(«) = (a — 1)! when «@ is a positive inte- 
ger. Graphs of several gamma distributions are shown in Figure 4.6 
and they exhibit the fact that these distributions are positively skewed. 
In fact, the skewness decreases as a increases for any fixed value of f. 
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Figure 4.6. Gamma density functions. 


The mean and the variance of the gamma distribution may be obtained 
by making use of the gamma function and its special properties men- 
tioned above. For the mean we have 


—_ l . vel p—x/P 
A= aoe | mex eFax 


and after letting y = x/8 and, hence, dy = dx/B, we get 


= ieee ao — PI + 1 
a= rhs | PAT 


Then, making use of the identity ['(# + 1) = «-I(a), we arrive at the 
result that 


w= oe 


Using similar methods and the identity 0? = yw, — yw?, it can also be shown 
that the variance of the gamma distribution is given by 
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In the special case where « = 1, we get the exponential density, whose 
equation is thus 







I —x/B 
po forx >0,B >0 
ioe z 


elsewhere 






and whose mean and variance are u = 8 and o” = f?. Note that the 
distribution of the example on page 100 was an exponential distribution 
with B = 4. 

The exponential distribution has many important applications; for 
instance, it can be shown that in connection with Poisson processes (see 
Section 3.7) the waiting time between successive arrivals (successes) have 
exponential distributions. More specifically, it can be shown that if in 
a Poisson process the mean arrival rate (average number of arrivals per 
unit time) is «, the time until the first arrival, or the waiting time between 


successive arrivals, has an exponential distribution with B = oa (see 


Exercise 11 on page 124). 


Example. With reference to the example on page 82, where on the 
average 3 trucks arrived per hour to be unloaded at a warehouse, 
we can now say that on the average there is a waiting time of 4 of an 
hour, or 20 minutes, between the arrival of successive trucks. Also, 
the probability that the time between the arrival of successive trucks 
is less than 5 minutes is 


1/12 
| 30-3* dy = 1 — e-/* = 0.221 


0 


and the probability that it will exceed 45 minutes is 


” 39-3% dy = e-9/4 = 0.105 


3/4 


4s THE BETA DISTRIBUTION 


In Chapter 9 we shall need a probability density for a random variable 
which takes on values on the interval from 0 to 1, and most appropriate 
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for this purpose is the beta distribution, whose equation is given by 


T@+ £) a-1l __ Bp-1 
ja=T@agy 8) Sesh 


0 elsewhere 





The mean and the variance of this distribution are given by 


O 2 __ ap 
gp NS = Gaya pe) 





= 


Note that for 7 = 1 and 6 = 1 we obtain a special case of the uniform 
density of Section 4.5, defined on the interval from 0 to 1. The following 
example, pertaining to a percentage or proportion, illustrates a typical 
application of the beta distribution: 


Example. Suppose that in a certain county the proportion of highway 
sections that require repairs in any given year is a random variable 
having the beta distribution with a = 3 and 6 = 2. The mean of this 
3 
a 0.60, 
which means that on the average 60 percent of the highway sections 
require repairs in any given year. To find a probability connected with 
this distribution, let us determine, say, the probability that at most 
50 percent of the highway sections will require repairs in a given year. 
Substituting « = 3 and # = 2 into the formula for the beta distribu- 
tion and making use of the fact that (5) = 4! = 24, [(3) = 2! = 2, 
and I\(2) = 1! = 1, we get 


distribution, whose graph is shown in Figure 4.7, is u = 


12x*?(1 — x) ior 0 =< x <= J 
0 elsewhere 


fos) = | 
Thus, the desired probability is given by 
1/2 
{ 12x*(1 — x) dx = 7 
0 


In general, probabilities relating to beta distributions are determined with 
the use of special tables. 
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Figure 4.7. Beta distribution with « = 3 and f = 2. 






24.9 THE WEIBULL DISTRIBUTION 


Closely related to the exponential distribution is the Weibull distribu- 
tion, whose probability density is given by 


aBxP-te-%**¥ = forx >0,4>0, RB >0 


0 elsewhere 


f= 





To demonstrate this relationship, let us write the probability that a random 
variable having the Weibull distribution will take on a value less than 
a as 


P(x <a) = | ax!" dx 
0 
Then making the change of variable y = x’ so that dy = Bx*—' dx, we get 
ah Z 
Pi <a) = Py = 2")=— { ae dy = 1 — e~*4 
0 


and it can be seen that y is a value of a random variable having an expo- 
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Figure 4.8. Weibull distributions with « = 1 and # = 3, 1, and 2. 


nential density. The graphs of several Weibull distributions having « = 1 
and, respectively, 6 = 4, B = 1, and # = 2 are shown in Figure 4.8. 

The mean of the Weibull distribution having the parameters « and # 
may be obtained by evaluating the integral 


 — | x-aBxP- ters? dx 
0 
Making the change of variable u = ax’, we get 


0 


and recognizing the integral asT'( 1 + ES ,namely, as a value of the gamma 


B 


function which we defined on page 117, we find that the mean of the 
Weibull distribution is given by 





b= ao PD l+— 
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Using a similar method to determine first w>, the reader will be asked to 
show in Exercise 18 on page 125 that the variance of this distribution 1s 
given by 


ot = or*er(1 +) — [rite] ]t 


Example. Suppose that the lifetime of a certain kind of battery (in 
hours) may be looked upon as a random variable having the Weibull 
distribution with « = 0.1 and B = 0.5. We thus find that such batteries 
last on the average 


u = (0.1)-2F(3) = 200 hours 


and if we wanted to know the probability that such a battery will last 
more than 300 hours, we would get 


P(x > 300) = i” (0.05)x~0-5e79-12° de 


= 0.177 


eo. 1(300)°-5 


eee U.69 EXERCISES 


Lr the cumulative distribution function which corresponds to the uniform 
: density. 


A Yy 2. From experience Mr. Harris has found that the low bid on a construction job 
iz 4 can be regarded as a random variable having the uniform density 


f= Zn for 2S <x <2€ 
0 elsewhere 


where C is his own estimate of the cost of the job. What percentage should 
Mr. Harris add to his cost estimate when submitting bids to maximize his 
ZZ pected profit? 


(3, erify the expression given on page 116 for the mean of the log-normal 
distribution. 


A 


1S 
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6. 


Te 


(5) 
‘3 
X 


B a random variable has the log-normal distribution with o = —1 and 
= 2, find 
i the mean and the standard deviation of this distribution; 
(b) the probability that this random variable will take on a value between 
3.2 and 8.4; 
(c) the probability that this random variable will take on a value greater 
than 5.0. 


Verify the expression for the variance of the gamma distribution given on 
page 118. 


Show that when & > 1, the graph of the gamma density has a relative maxi- 
mum at x = B(« — 1). What happens when 0 < & < 1 and when & = 1? 


If a random variable has the gamma distribution with « = 2 and # = 2, 
find 

(a) the mean and the standard deviation of this distribution; 

(b) the probability that the random variable will take on a value less than 4. 


8. 8. jin a certain city, the daily consumption of electric power (in millions of 
\/ kilowatt- hours) can be treated as a random variable having a gamma distri- 


11. 


12. 





10. : If the number of days a watch will run without having to_be reset is a random 


\ variable having the exponential distribution with B = 50, find the probability 


bution with « = 3 and B = 2. If the power plant of this city has a daily 
capacity of 12 million kilowatt-hours, what is the probability that this power 
supply will be inadequate on any given day? 


. If nm salesmen are employed in a door-to-door selling campaign, the gross 


sales volume in thousands of dollars may be regarded as a random variable 
having the gamma distribution with & = 100,/n and f = 4. If the sales 
costs are $5,000 per salesman, how many salesmen should be employed to 
maximize the profit? 


that 
(a) such a watch will have to be reset in less than 20 days; 
(b) such a watch will not have to be reset in at least 60 days. 


Given a Poisson process with on the average & arrivals per unit time, find 

(a) the probability that there will be no arrivals during a time interval of 
length f; 

(b) the probability that the waiting time between arrivals will be at least of 
length ¢. 

Having thus obtained the distribution function for the waiting time between 

successive arrivals, find 

(c) the corresponding probability density by differentiating with respect to ¢. 


With reference to Exercise 8 on page 87, find the percent of the time that 
the interval between breakdowns of the computer will 

(a) be less than one week; 

(b) exceed 5 weeks. 


125 4.10 Joint Probability Densities 


13. With reference to Exercise 10 on page 87, find the probability that the time 
between successive calls arriving at the switchboard of the consulting firm 
will 
(a) be less than half a minute; 

b) exceed 3 minutes. 







erify for & = 3 and PB = 3 that the integral of the beta density, from 0 to 1, 
is equal to 1. 


15. Show that when « > 1 and f > 1, the beta density has a relative maximum 
at 


ppg ON 
a+ Bp—2 


16. Suppose that the proportion of defectives shipped by a vendor, which varies 
somewhat from shipment to shipment, may be looked upon as a random 
variable having the beta aistribution with & = 1 and B = 4. 

(a) Find the mean of this beta distribution, namely, the average proportion 
of defectives in a shipment from this vendor. 

(b) Find the probability that a shipment from this vendor will contain 25 
percent or more defectives. 


17. If the annual proportion of erroneous income tax returns filed with the 
I.R.S. can be looked upon as a random variable having a beta distribution 
with & = 2 and f = 9, what is the probability that in any given year there 
will be fewer than 10 percent erroneous returns ? 


/ (8. erify the formula for the variance of the Weibull distribution given on page 
Mead 25 


19. Suppose that the time to failure (in minutes) of certain electronic components 
subjected to continuous vibrations may be looked upon as a random variable 
having the Weibull distribution with « = 4 and f = 4. 

(a) How long can such a component be expected to last? 
(b) What is the probability that such a component will fail in less than 5 
hours. 


/20. Referring to the example on page 123, find the probability that such a battery 
\ will not last 100 hours. 


> 4.10 JOINT PROBABILITY DENSITIES 





There are many situations in which we describe the outcome by giving 
the values of several random variables. For instance, we may measure the 
weight and the hardness of a rock, the volume, pressure, and temperature 
of a gas, or the thickness, color, compressive strength, and potassium 
content of a piece of glass. If x;, x.,..., x, are the values of k random 
variables, we shall refer to a function f with values f(x,, X2,..., %,) as 


126 Ch. 4 Probability Densities 


the joint probability density of these random variables if the probability 
that @, = x; = Dy, Qs = Xs = Dg, 1s, and a, = X;, = Bx Is given by the 
multiple integral 


bi phe bx 
{ | wine | TM pg ey vw ge Re) ay Uy. cs 


Qa a2 





Thus, not every function f with values f(x,, x,,...,X,) can serve as a 
joint probability density, but if we impose the conditions that: 


1. f(%1, X%2,...,X,) = 0 for all values of x;,X2,..., Xx, for which 
the joint probability density is defined 


of ee ee oes Xp) dX, dx... dx, = 1 





it can be shown that the axioms of probability (with the modifications 
discussed in Section 4.1) are satisfied. 

To extend the concept of a cumulative distribution function to 
the k-variable case, we write as F(x,, X,,...,X;,) the probability that the 
first random variable will take on a value less than or equal to x,, the 
second random variable will take on a value less than or equal to x,,..., 
and the kth random variable will take on a value less than or equal to x,, 
and we refer to the corresponding function F as the joint distribution 
function of the k random variables. 


Example. To illustrate the various concepts we have introduced, let 
us consider the joint probability density given by 


he re for x, > 0, x5. > 0 
f(X1, X2) = 


0 elsewhere 


Integrating between the appropriate limits, we thus find that the 
probability of the first random variable taking on a value between 1 
and 2 and the second taking on a value between 2 and 3 is given by 


2 
{ {. 6e7 281-32 dy = (e72 — e-4)\(e~5 — e79) 
1 2 
= 0.0003 


127 4.10 Joint Probability Densities 


and that the probability of the first random variable taking on a value 
less than 2 and the second taking on a value greater than 2 is given by 


i \ 6e~2*1-31 dx = (1 — e*)e~® 
0 42 
= 0.0025 


Also, by definition the joint distribution function of the two random 
variables is given by 


x1 x2 gee 
F(x; Xy) =< if \. 6e du dv for x1 a 0, X» > 0 


0 elsewhere 
or 
(1 — e-?*)(1 — e7?*) for x, = 0, X5 = 0 
F(x1; X) = 
0 elsewhere 


and the probability that both random variables will take on a value 
less than 1, for example, is given by 
F(1, 1) = 0 — e77) — e7) 
= 0.8216 
Given the joint probability density of k random variables, the prob- 


ability density of the ith random variables can be obtained by integrating 
out the other variables; symbolically, we thus write 


i —_——_—— 


fix) = i es i. Is sd { fGex hee RE OG os 2 ONag Aha co OM 





and, in this context, the function f; is called the marginal density of the 
ith random variable. Integrating out only some of the k random variables, 
we can similarly define joint marginal densities of any two, three, or more 
of the k random variables. 


Continuation of Example. For instance, in our example the marginal 
density of the first random variable is given by 


ee \, Ge 2a St gx, for x, >0 
1 iy 


elsewhere 
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or 
26 for x; = 0 
fi(%1) = 
0 elsewhere 
To explain what we mean by the independence of continuous random 
variables, we could proceed as in Section 2.5 and define conditional densities 
first; however, it will be easier to say that 


k random variables are independent if and only if 
F(x,, Nay +205 Xx) — Fy (X1)+F,(X2)- cee °F (X;) 


for all values of these random variables for which the respective 
functions are defined. 


In this notation, F(x,,x.,...,X,) is, as before, a value of the joint 
distribution function of the k random variables, while F,(x;) for i= 
1, 2,..., are the corresponding values of the individual distribution 
functions of the respective random variables. 


Continuation of Example. In our example we have 


ree (1 — e~**1)(1 — e73*) for x, >Oand x, >0 
ee 0 elsewhere 
1 — e7**: for x, >0 
F(x ,) = 
0 elswehere 
| = e-o™ for X53. > 0 
P(X.) = 
a(%s) fo elsewhere 


and it can be seen that the two random variables of our example are 
independent. 


Analogous to the special multiplication law for independent events and 
its extension on page 36, it follows immediately from our definition that 
if k random variables are independent, any value of their joint probability 
density equals the product of the corresponding values of the marginal densi- 
ties of the k random variables; symbolically, 


bit cre. cheers. = fi(%1)-fa(%2)> «2. A(X) 
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Given two random variables with values x, and x,, we shall define the 
conditional probability density of the first given that the second takes on 
the value x, as 





g1(x1 | x2) ~ Dee fr(X2) #0 





where f(x,, X,) and f,(x,) are, as before, values of the joint density of the 
two random variables and the marginal density of the second. Note that 
this definition parallels that of conditional probabilities on page 34. 


Example. If two random variables have the joint probability density 


$(x, + 2x2) for 0 — x, = 1,04 % <1 
0 elsewhere 


fatar Xo) a 
the marginal density of the second is given by 
1 
fal%2) = | 3Or + 2x0) de = 4+ 4x.) for0 <x, <1 
0 


and f,(x,) = 0 elsewhere. Hence, the conditional density of the first 
given that the second takes on the value x, is given by 


2(x, + 2x,) 2x, + 4x, 


We Nag ee ag, eee 
21 ( 1 |X) I + 4x) nar oo 


for 0-= x. =< I 
and g,(x,|x,) = 0 elsewhere. We should add that this conditional 
density holds only for 0 < x, < I. 


The following is another concept which is of importance when deal- 
ing with k random variables. If a random variable takes on the value 
g(X1, Xo,.++5X,) Whenever k random variables take on the values 
X1, Xo,---» Xz, then the expected, or average value of the first random 
variable (namely, the mean of its distribution) is given by 





[fice fh less es eM Gere ae) des de dee 





where f(x,,...,X,) is, as before, a value of the joint density of the k 
random variables. 
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Example. For instance, referring to the joint density of the example 
on page 126, we find that the expected value of the product of the two 
random variables is 


| | ibe dx, dx, =1 
0 0 


EXERCISES 4,42 
‘a 






. If two random variables have the joint density 


Xixs for 0 = 4, < 1,0 < xe <2 
f(%1, X2) = 
0 elsewhere 
find 
(a) the probability that both random variables will taken on values less 
than 1; 


(b) the probability that the sum of the values taken on by the two random 
variables will be less than 1; 
i «$3 (c) the joint distribution function; @) 
7d) both marginal densities. . 
rt Also check whether the two random variables are independent. UL ) 


7 +P If two random variables have the joint density 
L} 


S(x + y?) forO<x<1,0<y<I1 
0 elsewhere 


fs») = | 


find 
(a) the probability that 0.2< x <0.5and04<y<0.6; 
(b) the joint distribution function; 
Y.760 both marginal densities; - oo 
‘(d) the probability that y < 0.5, using the result of (c); 40 * 7 OV 
, Ze) an expression for f,(x| y) for0 < y <1; 
(f) an expression for /,(x|4) and the mean of the conditional density of the 
{ first random variable when the second takes on the value y = 4. 





With reference to the example on page 129, find 
(a) an expression for the conditional density of the first random variable 
when the second takes on the value x, = 0.25; 
(b) an expression for the conditional density of the second random variable 
/ when the first takes on the value x,. 


4. If three random variables have the joint density 


k(x + ye? fr Oe x<— 10H pe ae= 0 
0 elsewhere 


$0592) = | 
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(a) find k; 

(b) find the probability that x < yandz> 1; 

(c) check whether the three random variables are independent; 

(d) check whether any two of the three random variables are pairwise inde- 
pendent. 


5, A pair of random variables has the “circular normal distribution” if their 
joint density is given by 


f(X1, X2) = 


em eal (x1 mi)? + (x2 H2)?] 





1 
2102 


for —co <x; < wand —w~ <x,<o~. 

(a) If uw; =2 and yw, = —2, and o = 10, use Table 3 to find the aia 
ability that —8 <x; < eh and —9 < x, <3. 

(b) If 4; = Uz = Oando = 3, find the prObauuity that (x1, X2) is contained 
in the region between the two circles x? + x} = 9 and xj + ay == 36. 


- 6. A bomb aimed at a point target has a lethal radius of 500 feet. Using the 
target as the origin of a rectangular system of coordinates, assume that the 
coordinates (x, y) of the point of impact are values of a pair of random vari- 
ables having the circular normal distribution (see Exercise 5) with Ld, = L2 
— 0 and o = 300 feet. What is the probability that the target will be 


F aeias 
. Referring to the random variables of Exercise 1, find the expected value of 
the random variable whose values are given by g(x;, x2) = %1 + X2. 


8. Referring to the random variables of Exercise 2, find the expected value of 
the random variable whose values are given by g(x, y) = x?y. 


. If measurements of the length and the width of a rectangle have the joint 
density 
aS pris] 2227542 pint ey a? 
F(x, y) — dab 2 2 ° ps Z 
0 


elsewhere 


find the mean and the variance of the corresponding distribution of the area 


f the rectangle 
sabi a relationship between 2,;(x1|X2), 22(X2|%1), fi(%1), and fo(x2). 


4.11 SIMULATION 


To simulate the observation of continuous random variables with the 
use of random numbers, we refer to their cumulative distribution functions, 
given either graphically or as tables. (The theory on which this method 
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is based involves the so-called probability integral transformation, which 
is treated in all of the textbooks on mathematical statistics referred to 
in the Bibliography at the end of the book.) 

To illustrate the graphical method, suppose we want to observe the 
values of a random variable having a normal distribution with a given 
mean and standard deviation. Taking two- or three-digit random num- 
bers (depending on the accuracy that can be attained) and treating them 
as decimal fractions by putting a decimal point to the left of the first digit, 
we refer to a graph of the cumulative distribution function of the standard 
normal distribution (see, for instance, Figure 4.9), mark the decimal 
fractions corresponding to the random digits on the F(z) scale, and then 
read the corresponding values of a random variable having the standard 
normal distribution off the horizontal scale. Finally, we obtained the 
simulated values of the original random variable by means of the equa- 
~— 


= E (see page 106) or x = w+ oz. 


tion Zz = 








Figure 4.9. Distribution function of standard normal distribution. 


Example. Suppose we want to simulate observations of a random 
variable having the normal distribution with w = 18.3 and o = 20. 
Using the 3rd and 4th columns of the first page of Table 7, starting 
with the 36th row, we obtain the two-digit random numbers 74, 19, 
78, 82, 06, 11, .... Treating the first of these as the decimal fraction 
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0.74 and marking it on the vertical scale of Figure 4.9, we find that 
the corresponding value of z is 0.64 and, hence, that the simulated 
value of the original random variable is 


x = 18.3 + 20(0.64) = 31.1 


Similarly, treating the random number 19 as a decimal fraction and 
marking 0.19 on the vertical scale of Figure 4.9, we find that the 
corresponding value of z is —0.88 and, hence, that the simulated 
value of the original random variable is 


x = 18.3 + 20(—0.88) = 0.7 


In Exercise 1 on page 134 the reader will be asked to verify that the 
next four random digits, namely, 78, 82, 06, and 11 yield x = 33.7, 
x = 36.7, x = —12.7, and x = —6.3. 


The graphical method we have described has the advantage that it 
can be used to simulate any continuous distribution, but it is generally 
very tedious as it requires that we carefully draw the graph of the cumula- 
tive distribution function. This can be avoided if a suitable table of 
cumulative probabilities is available. For instance, to simulate the obser- 
vation of values of a random variable having the standard normal dis- 
tribution (and, hence, any normal distribution), we refer to Table 3 and 
proceed as follows: given a two- or three-digit random number, we 
change it, as before, into a random decimal between 0 and 1. Then, if the 
random decimal is greater than or equal to 0.5000, we simply look for the 
value of z for which the entry in Table 3 is closest to the random decimal; 
if the random decimal is less than 0.5000, we look for the value of z for 
which the entry in Table 3 is closest to 1 minus the random decimal and 
record —z as the simulated observation. 


Continuation of Example. Let us use this method to rework the 
example on page 132, where we simulated observations of a random 
variable having the normal distribution with w = 18.3 and o = 20, 
using the two-digit random numbers 74, 19, 78, 82, 06, 11,.... Check- 
ing 0.7400 in Table 3, we find that the closest entry corresponds to 
z— 0.64, and, proceeding as before, we get the simulated obser- 
vation 


x= p+ oz=18.3 + 20(0.64) = 31.1 
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Then, checking 1 — 0.1900 = 0.8100 in Table 3, we find that the 
closest entry corresponds to 0.88, and, proceeding as on page 133 
with z = —0.88, we get the simulated observation 


x = 18.3 + 20(—0.88) = —0.7 


EXERCISES 


i Continue the example in the text by using 
(a) the diagram of Figure 4.9, 
(b) Table 3, 
to find the simulated values of the random variable which correspond to the 
random numbers 78, 82, 06, and 11. 


2. Suppose that the number of hours it takes a person to learn how to operate a 
certain machine is a random variable having a normal distribution with 
f= 5.8 and o = 1.2. Simulate the length of time it takes eight persons to 
learn how to operate the machine using 
‘a) the diagram of Figure 4.9; 

(b) Table 3. 


. Suppose that the increase in the pulse rate of a person performing a certain 
task is a random variable whose distribution can be approximated closely 
with a normal distribution having the mean 28.6 and the standard deviation 
5.15. Simulate the increase in the pulse rate of 20 persons performing this task, 
using Table 3 and rounding the answers to the nearest positive integer. 


4. Suppose that the durability of a paint (in years) is a random variable having 

an exponential density (see page 119) with the mean PB = 2. 

(a) Draw a graph of the distribution function of this random variable. 

(b) Use the graph obtained in (a) to simulate an experiment in which the 
paint is applied to 12 houses and it is observed in each case how long the 
paint lasts (that is, how long it takes until there are certain agreed-upon 
signs of deterioration). 


a 


Suppose that the occurrence of breakdowns of a radar tracking system is a 
Poisson process with A = 0.02 per hour, and the time required to repair any 
breakdown is fixed at 10 hours. Simulate 1,000 hours of operation of this SYS- 
tem and estimate the proportion of “down time.” [Hint: Simulate the waiting 
times between successive breakdowns using an appropriate exponential dis- 
tribution.] 


6. Since many simulation experiments entail sampling from normal populations, 
statisticians have constructed special tables of so-called random normal devi- 
ates (see Bibliography at the end of the book), which are looked upon as 
values of a random variable having the standard normal distribution. 
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(a) Using the method directly based on Table 3, construct a table of 100 such 
random normal deviates. 

(b) Use the table obtained in (a) to simulate 20 observations of a random 
variable having a normal distribution with uw = 14.25 ando = 2.16. 


{ Suppose that the annual number of collision-insurance claims in a given 
/~ automobile insurance policy class has the log-normal distribution with 
& = 1.5and B = 0.6. Simulate 10 years of experience with this policy class. 








Treatment 
of Data 





.1 FREQUENCY DISTRIBUTIONS 





Statistical data, obtained from surveys, experiments, or any series of mea- 
surements, are often so numerous that they are virtually useless unless 
they are condensed, or reduced, into a more suitable form. Thus, the first 
step of a statistical analysis often consists of the construction of a fre- 
quency table, that is, a table which divides the data into a relatively small 
number of classes (categories), listing the number of observations belong- 
ing to each class. A frequency table, or a frequency distribution as it is also 
called, sacrifices some of the information contained in a set of data; 
instead of knowing the exact value of each item, we only know that it 
belongs in a certain class. On the other hand, this kind of grouping often 
brings out important features of the data, and the gain in “legibility” of 
the data usually more than compensates for the loss of information. In 
what follows, we shall consider mainly numerical frequency distributions, 
that is, frequency distributions where the data are grouped according to 
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their numerical size. If the data are grouped according to some quality, or 
attribute, we shall refer to such a distribution as a categorical distribution. 

The first step in constructing a frequency table consists of deciding how 
many classes to use and choosing the limits for each class. Generally speak- 
ing, the number of classes we use depends on the number of observations, 
but it is seldom profitable to use fewer than five or more than fifteen 
classes. Among other things, we base this decision on the range of the data, 
that is, the difference between the largest observation and the smallest. 
Then, we tally the observations (much like the tally of votes in an election 
or, for that matter, any kind of count) and, thus, determine the class fre- 
quencies, namely, the number of observations in each class. 


Example. To illustrate the construction of a frequency table, let us 
consider the following 80 determinations of the daily emission of sulfur 
oxides (in tons) of an industrial plant: 


15.8 26.4 17.3 11.2. 23.9 24.8 18.7 13.9 9.0 13.2 
227 98 62 147 175 261 128 286 17.6 23.7 
26.8 22.7 18.0 20.5 11.0 20.9 15.5 19.4 16.7 10.7 
19.1 15.2 22.9 26.6 20.4 21.4 19.2 21.6 16.9 19.0 
185 23.0 246 20.1 16.2 18.0 7.7 13.5 23.5 14.5 
144 29.6 19.4 17.0 20.8 24.3 22.5 246 184 18.1 

83 219 123 22.3 13.3 118: 193° 20.0 23.7 318 
25.9 10.5 15.9 27.5 18.1 17.9 94 24.1 20.1 28.5 


Since the largest observation is 31.8, the smallest is 6.2, and the range 
is 25.6, we might choose the six classes having the limits 5.0-9.9, 10.0- 
14.9, ..., 30.0-34.9, we might choose the seven classes 5.0-8.9, 9.0- 
‘12.9, ..., 29.0-32.9, or we might choose the nine classes 5.0-7.9, 
8.0-10.9, ..., 29.0-31.9. Note that in each case the classes do not over- 
lap, they will accommodate all of the data, and they are all of the same 
size. 

Deciding upon the second of these classifications, we now tally the 
80 observations and obtain the results shown in the following table: 





Class limits Tally Frequency 
5.0— 8.9 // 1 3 
9.0-12.9 TRI THY 10 

13.0-16.9 THI TRE !////1 14 
17.0-20.9 THY TRY PAU THM. PAY 25 
21.0-24.9 PRE PR FRA 7 17 
25,0-28.9 Pres Gili 9 
29.0-32.9 // 2 





Total 80 
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Note that the class limits we used here contain as many decimals as the 
original data. If the original data had been given to two decimals we 
would have used the class limits 5.00-8.99, 9.00-12.99, ..., 29.00- 
32.99, and if they had been rounded to the nearest ton we would have 
used the class limits 5-8, 9-12, ..., 29-32. 


In this example, the actual daily emissions of sulfur oxides can be thought 
of as values of a continuous random variable, but if we used classes such as 
5.0-9.0, 9.0-13.0, ..., 29.0-33.0, we would run into difficulties: 9.0 could 
go into the first class or into the second, and 17.0 could go into the third 
class or into the fourth. To avoid this difficulty, we can let the first class go 
from 4.95 to 8.95, the second from 8.95 to 12.95, ..., and the last from 
28.95 to 32.95. We refer to these new limits as the class boundaries, and it 
should be noted that there will be no ambiguities even though the bound- 
aries overlap. These class boundaries are, so to speak, “impossible” values, 
since the data were given to only one decimal. In practice, we use the class 
boundaries rather than the original class limits chiefly when it is essential 
to stress the fact that the data are values assumed by a continuous random 
variable (or that they are otherwise continuous kinds of measurements). 

As we have pointed out earlier, once data are grouped, each observa- 
tion in a given class has lost its identity in the sense that its exact value is no 
longer known. This leads to some difficulties when we want to give further 
descriptions of the data, but we get around this by representing each 
observation in a class by its midpoint, called the class mark. In general, the 
class marks of a frequency distribution are obtained by averaging succes- 
sive class limits or successive class boundaries. If the classes are all of 
equal length, as in our example, we refer to the common interval between 
any successive class marks as the class interval. Note that the class interval 
can also be obtained from the difference of successive class boundaries, 
but not from the difference of successive class limits. If the classes are not 
all of equal length, we cannot speak of the class interval (of the distribu- 
tion), but we can refer to the difference between the boundaries of a given 
class as its length. 


Continuation of Example. For the distribution of the sulfur dioxide 
emission data, the class marks are 6.95, 10.95, 14.95, 18.95, 22.95, 
26.95, and 30.95, and the class interval is 4. 


There are several alternate forms of distributions into which data are 
sometimes grouped. Foremost among these are the “less than” or “more 
than” cumulative frequency distributions. A cumulative “less than” distri- 
bution shows the total number of observations that are less than given 
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values, and it is easily constructed from the corresponding (ordinary) 
frequency table. We do not know how many observations are less than the 
individual class marks, but we do know how many are less than each class 
boundary or each lower class limit. 


Continuation of Example. Using the class boundaries of the distribu- 
tion of the sulfur oxides emission data, we get 


Tons of Cumulative 

sulfur oxides frequency 
less than 4.95 0 
less than 8.95 3 
less than 12.95 13 
less than 16.95 2d 
less than 20.95 52 
less than 24.95 69 
less than 28.95 78 
less than 32.95 80 


where the cumulative frequencies were obtained by adding the fre- 
quencies, one by one, starting from the top of the frequency table. 
Note that instead of “less than 4.95” we could also have written “less 
than 5.0” or “4.9 or less.” 


Cumulative “more than” distributions are constructed, similarly, by adding 
the frequencies, one by one, starting at the bottom of the frequency table. 
In practice, “less than” cumulative distributions are used more widely, and 
it is not uncommon to refer to cumulative “less than” distributions simply 
as cumulative distributions. 

If it is desired to compare two or more frequency distributions whose 
total frequencies are not equal, it may be necessary (or at least advanta- 
geous) to convert them into so-called percentage distributions. We simply 
divide each class frequency by the total number of observations in the 
distribution and multiply by 100; in this way we indicate what percentage 
of the data falls into each class of a distribution. The same principle can 
also be applied to cumulative distributions, by replacing each cumulative 
frequency by the corresponding cumulative percentage. 


Example. As a further illustration let us construct a frequency distribu- 
tion as well as a percentage distribution for the following data repre- 
senting the number of workers absent froma factory on 50 working days: 
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Is 3 i 37 10 1@ 2 11 © 2 
8 21 12 11 7 7 9 16 49 18 
3 Il 19 6 15 10 14 10 7 24 

ll 3 6 I 4 6 32 2 12 FY 

29 12 9 19 $8 20 15 S 17 16 


Since these data consist of integers, there will be no ambiguities if we 
use the class limits 0-4, 5-9, 10-14, ..., and we thus obtain the fol- 
lowing frequency and percentage distributions: 


Number of Number of 
absences Frequency absences Percentage 

O- 4 4 O- 4 8 
5— 9 15 5- 9 30 
10-14 16 10-14 32 
15-19 8 15-19 16 
20-24 3 20-24 6 
25 or more 4 25 or more 8 


Note that in these tables we replaced the five classes 25-29, 30-34, 35-39, 
40-44, and 45-49, which would ordinarily have been needed, by a single 
class having no upper limit. This device of using an open class is thus used 
(especially in connection with skewed distributions) to eliminate the 
trouble of having to list a large number of classes that are either empty or 
have very small frequencies. 

Note also that when distributions of data are presented in their final 
form, the tally is practically always omitted, and we show only the class 
limits, the class boundaries, or the class marks, and, of course, the corre- 
sponding class frequencies, percentages, cumulative frequencies, or cumul- 
ative percentages. 


vA GRAPHS OF FREQUENCY DISTRIBUTIONS 


Many important properties of frequency distributions, such as their 
symmetry or skewness, the number of their modes (maxima), and so on, 
are best exhibited by means of graphs. In this section we shall introduce 
some of the most widely used forms of graphical presentation of frequency 
distributions, percentage distributions, and cumulative distributions. 

Referring to the frequency table of the sulfur oxides emission data on 
page 137, we might present the information it contains graphically by first 
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plotting the points (x,, f;), where x; is the class mark of the ith class and f; 
is the corresponding frequency, and then drawing straight-line segments to 
connect successive points. The resulting frequency polygon is shown in 
Figure 5.1; it should be noted that we added classes with zero frequencies 
at both ends of the distribution in order to “tie down” the graph to the 
horizontal axis. It is apparent from Figure 5.1 that the distribution of the 
sulfur oxides emission data is nearly symmetrical and bell-shaped, some- 
what resembling the graph of the normal distribution shown in Figure 4.2. 





Figure 5.1. Frequency polygon. 


An alternate way of presenting a frequency distribution in graphical 
form is the histogram, first mentioned on page 53 in connection with 
probability distributions. The histogram of a frequency distribution is 
constructed like that of a probability distribution; the heights of the rec- 
tangles represent the class frequencies and the bases of the rectangles 
extend between successive class boundaries. A histogram of the sulfur 
oxides emission data is shown in Figure 5.2. 

It is sometimes preferable to look upon the areas of the rectangles, 
rather than their heights, as representing the class frequencies of a histo- 
gram. This applies in particular to situations where we wish to approximate 
histograms with smooth curves or where there are classes of unequal 
length (see Exercise 17 on page 149). 
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Figure 5.2. Histogram. 


Inspection of the graph of a frequency distribution often brings out 
features that are not immediately apparent from the data themselves. 
Aside from the fact that such a graph presents a good overall picture 
of the data, it can also emphasize irregularities and unusual features. 
For instance, outlying observations which somehow do not fit the overall 
picture, that is, the overall pattern of the data, may be due to errors of mea- 
surement, equipment failure, and similar causes. Also, the fact that a histo- 
gram or a frequency polygon exhibits two or more modes (maxima) can 
provide pertinent information. The appearance of two modes may imply, 
for example, a shift in the process that is being measured, or it may imply 
that the data come from several sources. With some experience one learns 
to spot such irregularities or anomalies, and an experienced engineer 
would find it just as surprising if the histogram of a distribution of inte- 
grated-circuit failure times were symmetrical as if a distribution of 
American men’s hat sizes were bimodal. 

Cumulative distributions are usually represented graphically by means 
of ogives. An ogive is similar to a frequency polygon, except that we plot 
the cumulative frequencies at the class boundaries instead of the ordinary 
frequencies at the class marks. The resulting points are again connected by 
means of straight-line segments, as shown in Figure 5.3, which represents 
the cumulative “less than” distribution of the sulfur oxides emission data. 
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Figure 5.3. Ogive. 


Ogives are nearly always S-shaped, with the relative size of the two 
tails of the S determined by the symmetry or lack of symmetry of the dis- 
tribution. If a distribution follows closely the pattern of a normal curve, it 
is possible to “straighten out” the S by using a special vertical scale, called 
a probability scale. This scale is so designed that any cumulative normal 
distribution will graph as a straight line. Special graph paper, called arith- 
metic probability graph paper, is commercially available; it has one arith- 
metic (ordinary) scale and one probability scale (see Figure 5.4). 


Example. To illustrate, the cumulative “less than” percentage distribu- 
tion of the sulfur oxides emission data has been plotted on probability 
graph paper in Figure 5.4. Note that the cumulative percentages are 
plotted at the corresponding class boundaries. As can be seen from this 
diagram, the points all lie close to the dashed line, and we can say that 
the data are approximately normally distributed. 


To check for “normality,” probability graph paper can also be used 
directly with ungrouped data. In that case we first arrange the observations 


ae | 
according to size, and if there are n observations we plot 100 — or 
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Figure 5.4. Normal probability graph — grouped data. 


(1007 — 50) 
n 
observation (see Exercise 16 on page 148). If is large, it is not necessary 
to plot every point; every fifth or tenth point will generally suffice to check 
whether the distribution of the data follows closely the pattern of a normal 

distribution. 


percent on the vertical scale corresponding to the ith largest 


Continuation of Example. Referring again to the sulfur dioxides emis- 
sion data on page 137 (arranged according to size), a probability graph 
plotted for every tenth observation, and also the smallest observation, 
is shown in Figure 5.5. Again, it is apparent that the points fall close to 

-a Straight line and, hence, that the data are approximately normally 
distributed. | 


In view of the symmetry of the normal distribution, the mean of the 
normal distribution which approximates a given distribution can be obtain- 
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\ Figure 5.5. Normal probability graph — ungrouped data. 


ed by reading off the horizontal scale the value on the line (fitted to the 
points) which corresponds to 50 percent on the vertical scale. Similarly, 
the standard deviation of the distribution can be approximated by taking 
the difference between the values on the horizontal scale which correspond 
to 84 percent and 50 percent on the vertical scale (or half the difference of 
‘the values which correspond to 84 percent and 16 percent on the vertical 
scale). 


—e————E—~—EEy - 


| . Continuation of Example. Thus, it can be seen from Figures 5.4 or 5.5 

that the mean of the normal distribution which approximates the dis- 
: tribution of the sulfur oxides emission data is about 18.5, and that its 
| standard deviation is about 24 — 18.5 = 5.5. 
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It must be understood that the use of probability graph paper, which 
should really be called normal probability graph paper, is only an approxi- 
mate (and highly subjective) device for checking whether a distribution 
follows the pattern of a normal curve. Only large and obvious departures 
from linearity in such a graph are real evidence that the data do not follow 
the pattern of a normal curve. 

Sometimes other special scales are used to graph cumulative distribu- 
tions. For example, if it is suspected that a set of data follows the pattern 
of a log-normal distribution, probability graph paper with a logarithmic 
scale can be used. Such paper is also commercially available. Various | 
other scales have been devised for checking whether data follow the pat- 
terns of corresponding theoretical distributions, 
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VW, 

EE scurmans of the ignition temperature of a gas vary from 1,161 to 1,319 
degrees Fahrenheit. Construct a table with eight equal classes into which 
these data might be grouped. Give 
(a) the class limits; (b) the class marks; 

(c) the class boundaries; (d) the class interval. 


bd 2. The weights of certain mineral specimens, given to the nearest tenth of an 
ounce, are grouped into a table having the classes 10.5-11.4, 11.5-12.4, 
12.5-13.4, and 13.5-14.4 ounces. Find 
(a) the class marks; 
, (b) the class boundaries; 


(c) the class interval. 
, Re 3 The following are measurements of the breaking strength (in ounces) of a 
5 sample of 60 linen threads: 
BXS 15.2 354 21.3 GBA 26.9) 346 29,3) 245 316 
“21.2 0837 CLD-EEDW 327 GID 302 239 23.0264 
CELD 334 CTH 21.9 2.3173 BM 368 29.2) 23.5 
20.6 @9,5) 21.8 375 335 29.6 269 C87 348 18.6 
CSA 341 CLSG9G 22.2 22.7 345 332 340 283 
369 24.6 (BD 24.8 81 25.4) 345 23.6 38:4 24.0 


ZZ Group these measurements into a distribution having the classes 15.0-19.9, 
j 20.0-24.9, ..., 35.0-39.9 and construct a histogram. 


be (A. Convert the distribution of Exercise 3 into a cumulative “less than” distribu- 
tion and graph its ogive. 


5. Convert the cumulative distribution of Exercise 4 into a cumulative percen- 
tage distribution and, plotting its graph on arithmetic probability paper, 
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check whether it is reasonable to approximate the distribution of the original 
data with a normal distribution. 


s ya The following are the ignition times of certain upholstery materials exposed 
to a flame, given to the nearest hundredth of a second: 


2.58 2.51 4.04 6.43 1.58 4.32 2.20 4.19 
4.79 6.20 1.52 1.38 3.87 4.54 5.12 5.15 
5.50 5.92 4.56 2.46 6.90 1.47 2.11 2.32 
6.75 5.84 8.80 7.40 4.72 3.62 2.46 8.75 
2.65 7.86 4.71 6.25 9.45 12.80 1.42 1.92 
7.60 8.79 5.92 9.65 5.09 4.11 6.37 5.40 
11.25 3.90 5.33 8.64 7.41 7.95 10.60 3.81 
3.78 3.75 3.10 6.43 1.70 640 3.24 1.79 
4.90 3.49 6.77 5.62 9.70 5.11 4.50 2.50 
S21 1.76 9.20. 1.20 6.85. 2.80 7.35 11.75 


Group these figures into a table with a suitable number of equal classes and 
construct a frequency polygon. 


. Convert the distribution of Exercise 6 into a cumulative “less than” percen- 
tage distribution and plot its ogive. | 


8. Plot the cumulative percentage distribution of Exercise 7 on arithmetic as 
well as logarithmic probability graph paper. What do these graphs suggest 
about the form of the distribution? (If logarithmic probability paper is not 
available, plot the logarithms of the class boundaries on arithmetic prob- 
ability paper.) 


f 
yo. The following are 15 measurements of the boiling point of a silicon com- 
pound (in degrees Centigrade): 166, 141, 136, 153, 170, 162, 155, 146, 183, 
157, 148, 132, 160, 175, and 150. Plot the data directly on arithmetic prob- 
ability paper and indicate whether they follow the pattern of a normal dis- 
tribution. 


S uf, 3 10. In a two-week study of the productivity of workers, the following data were 
obtained on the total number of acceptable pieces which 100 workers pro- 


duced: 
Ep) 36 Q & 1°56 2 @ @ és 
31 8) BY 72 55 6D-22 


nS @ 51 5 39 Sn wy 
5 @ 76 74” 70 51. Ay BP 56 
6) 52 3 32RD 64) 53.44 34 


DO 58 51 54 43) 44 35 51 


33. GD) 77 @ 3 
a 34 6D O52 @ 32 
g 55° @).73.50 53 59 54 


74 62) 58°26 35 @ 50 38 7 
Group these data into a suitable frequency table and plot its histogram. 


fiaan 10-39, 30-39, YO-¥T ~~ gO - SY 
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£1] 11. 
12. 


13. 


14. 


15. 


16. 


Convert the distribution obtained in Exercise 10 into a cumulative “less than” 
percentage distribution and plot its ogive. 


Plot the cumulative percentage distribution of Exercise 11 on arithmetic 
probability paper and interpret the result. 


The following are the number of automobile accidents which occurred at 60 
major intersections in a certain city during the Fourth of July weekend: 


025 014102 1 
> OL S&S UB 2 ft 2d 
1402412 4 0 4 
3 5 0 1 3 6 420 2 
OQ 2 Be A 2 SS 2b I -2 
A A. i Oe Be a 


(a) Group these data into a frequency distribution showing how often each 
of the numbers occurred. 
(b) Construct a cumulative “or more” distribution and draw its ogive. 


Categorical frequency distributions are often represented graphically by 
means of pie charts in which a circle is divided into sectors proportional in 
size to the frequencies with which the data are distributed among the cate- 
gories. Construct a pie chart to represent the following distribution: 


Number of B.A. degrees 
in engineering conferred 
in the U.S. in 1970-71 


Aeronautical engineering 2,443 
Chemical engineering 3,579 
Civil engineering 6,526 
Electrical engineering 12,198 
Mechanical engineering 8,858 
Industrial engineering 3.171 
General engineering 2,864 


The pictogram of Figure 5.6 is intended to illustrate the fact that in a certain 
area, average family income has doubled from $5,000 in 1955 to $10,000 in 
1970. Does this pictogram convey a “fair” impression of the actual increase 
in family income? If not, state how it should be modified. 


Given a set of observations x,, x2,..., and x,, we define their empirical 

cumulative distribution as the function whose values F(x) equal the proportion 

of the observations less than or equal to x. 

(a) Graph the empirical cumulative distribution for the 15 measurements of 
Exercise 9. 
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(b) Verify that the method of curve fitting used on page 144 (the check for 
“normality” for ungrouped data) amounts to plotting points correspond- 
ing to the midpoints of the steps of the empirical cumulative distribution. 





Figure 5.6. Pictogram for Exercise 15. 


{ 17. Convert the distribution of the sulfur oxides emission data on page 137 into 

a distribution having the classes 5.0-8.9, 9.0-20.9, 21.0-28.9, and 29.0-32.9. 
Draw two histograms of this distribution, one in which the class frequencies 
are given by the heights of the rectangles and one in which the class fre- 
quencies are given by the areas of the rectangles. Explain why the first of 
these histograms gives a very misleading picture. 


: 5.3. DESCRIPTIVE MEASURES 


In Section 3.4 we introduced the mean and the variance of a probability 

distribution as parameters which measure, respectively, its center and its 

| spread. In this section we shall define corresponding measures to describe 
\ a set of data or its distribution. 

Given a set of m measurements or observations, x,, X,,..., X,, there 
are several ways in which we can describe their center (middle, or central 
location). Foremost among these are the arithmetic mean and the median, 
although other kinds of “averages” are sometimes used for special pur- 
poses. The arithmetic mean or, more succinctly, the mean is defined by the 
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formula 





Note that we have written the mean of the x’s as x and not as yw, the sym- 
bol we used for the mean of a probability distribution (or probability 
density). In this connection, we shall follow the general practice of using 
Latin letters fo denote descriptions of actual data and Greek letters to 
denote descriptions of theoretical distributions. It is also of interest to 
note that the above formula would yield the mean of the distribution of a 
random variable which takes on the values x; with equal probabilities of 
] 
a 
Sometimes it is preferable to use the median as a descriptive measure 
of the center, or location, of a set of data. This is true, particularly, if it is 
desired to reduce all calculations to a minimum or if it is desired to elimi- 
nate the effect of extreme (very large or very small) values. The median of 
n observations x,, X,,..., X, can be defined loosely as the “middlemost” 
value once the data are arranged according to size. More precisely, if the 
observations are arranged according to size and n is an odd number, the 
n+], 
ane 
number, the median is defined as the mean (average) of the observations 


n Sas 
numbered > and 5 





median is the value of the observation numbered if nm is an even 





Example. For instance, the median of the five observations 15, 14, 2, 
27, 13 is 14 (the value of the third largest observation), and the median 
of the six observations 17, 9, 15, 19, 4, 16 is 15.5 (the mean of the third 
and fourth largest observations). 


Although the mean and the median each provides a single number to 
represent an entire set of data, the mean is usually preferred in problems 
of estimation and other problems of statistical inference. An intuitive 
reason for preferring the mean is that the median does not utilize all the 
information contained in the observations. A related reason is that the 
median is generally subject to greater chance fluctuations, that is, it is apt 
to vary more from sample to sample. This important concept of “sampling 
variability” will be explored in detail in Chapter 6. 
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Example. To give an example where the median actually gives a better 
description than the mean, suppose an employer claims that the mean 
salary paid to engineers in his firm is $24,000. This gives the impression 
that this firm is a good place to work. However on further examination 
it turns out that it is a small company which employs four young engi- 
neers at $12,000 each, plus the owner whose income is $72,000. Thus, 
the income distribution is highly skewed; its mean of $24,000 repre- 
sents nothing useful, while the median income of $12,000 is at least 
representative of what a young engineer can expect to earn with this 
firm. 


Like the variance of a probability distribution, the variance of n obser- 
vations X,, X,,..., X, measures essentially the average of their squared 
deviations from their mean, x, and it is defined by the formula 


n 

: dy (%; — Xx)? 

SS 
n— | 


There are several reasons for using the divisor n — 1 instead of n. First, 
only n — 1 of the deviations from the mean, x, — x, are independent, 
since their sum is always equal to zero (see Exercise 1 on page 156). In 
other words, n — 1 of the deviations from the mean automatically deter- 
mine the mth. Another reason is that if we look upon the x’s as values 
assumed by some random variable, division by nm — | in the formula for s 
makes this variance a better estimate of 0”, the variance of the distribution 
of this random variable. We shall say more about this in Chapters 7 and 8. 

Consistent with the terminology of Chapters 3 and 4, we define the 
standard deviation of n observations x;, X,,..., X, aS the square root of 
their variance, namely, as 


Gia 


_— n— | 


Note that if we had divided by v instead of by n — 1, the resulting formula 
could have been used for the standard deviation of the distribution of a 
random variable which assumes the values x; with equal probabilities of 
1/n. 
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The standard deviation and the variance are measures of absolute vari- 
ation, that is, they measure the actual amount of variation present in a set 
of data, and they are dependent on the scale of measurement. To compare 
the variation in several sets of data, it is generally desirable to use mea- 
sures of relative variation; for this purpose we use the so-called coefficient 
of variation 


qd 
Ss 
| 
ei] & 
S 
i 


Note that this measure, which gives the standard deviation as a percentage 
of the mean, is independent of the scale of measurement. 


Example. If a set of data has the mean x = 12.0 and the standard 
deviation s = 1.5, the coefficient of variation is CV = 12.5 percent; in 
other words, the standard deviation 1s 12.5 percent of the mean. 


In this section, we have limited the discussion to the mean, the median, 
the variance, and the standard deviation. There are, of course, many other 
ways of describing sets of data. For instance, if we want to determine a 
value below which we find the lowest 25 percent of data, we calculate the 
first quartile O,; if we want to determine a value above which we find the 
highest 5 percent of the data, we calculate the ninety-fifth percentile P,;; 
and so on. For grouped data, the calculation of such fractiles is similar to 
that of the median. There are also alternate ways of describing the vari- 
ability of a set of data. For instance, there is the sample range (the largest 
value minus the smallest, which we shall have the occasion to use is Section 
8.1), the mean deviation, and the interquartile range. As the need arises, 
new ways and new methods of describing statistical data are constantly 
being developed. 


5.4 THE CALCULATION OF X AND § 


In this section we shall discuss methods for calculating x and s for 
raw (ungrouped) as well as grouped data. The methods we shall employ 
are particularly well-suited for small calculators, and they are both rapid 
and accurate. 

The calculation of x for ungrouped data does not pose any problems; 
we have only to add the values of the observations and divide by n. On the 
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other hand, the calculation of s? is usually too cumbersome if we directly 
use the formula on page 151. Instead, we shall use the algebraically equiv- 
alent form 


n n 2 
ne >, x? — (33 x;] 
9? = —/51 f=1 
n(n — 1 


which has the dual advantage of requiring less labor and giving better 
accuracy. (In Exercise 5 on page 157 the reader will be asked to show that 
this formula is equivalent to the one on page 151.) The decrease in labor 
results from the fact that we do not actually have to calculate the devia- 
tions from the mean; the increase in accuracy is due to the reduction of the 
number of divisions and subtractions, and their postponement until the 
last two steps in the calculation. 


Example. To illustrate the calculation of x and s let us find the mean 
and the standard deviation of the following miles per gallon obtained 
in 20 test runs performed with an imported sub-compact car: 


I9.7 21,5 22,5 22.2 22.6 
219 205 195 199 214 
22.8 23.2 21.4 20.8 19.4 
220: 230 -21,1. 20.9 21.5 


Using a calculator, we find that the sum of these figures is 427.7 and 
that the sum of their squares is 9,173.19. Consequently, 


3 22717 _ 4132 
Z 
and 
, _. 20(9,173.19) — (427.7)* __ 
a 0L19 — ].412 


and it follows that s = 1.19. In computing the necessary sums we 
usually retain all decimal places, but as in this example, at the end we 
usually round to one more decimal than we had in the original data. 


The calculations we have illustrated in this example can be made even 
simpler by first coding the observations; that is, by performing a suitable 
linear transformation. 
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Continuation of Example. If we multiply each of the mileages by 10 to 
eliminate decimal points and then (arbitrarily) subtract 200 to make the 
numbers easier to work with, we now have the coded values 


—3° 15 :25 22 26 
9 3s=-7 =| 
28 32 14 8 —6 
20 30 ll 9 13 


which we shall refer to as u’s. Calculating the mean and the standard 
deviation of the u’s by the method that we used before, we obtain 277 
for the sum of the u’s, 6,519 for the sum of their squares, and, hence, 


w= 13.8 and s,=— 11.9 


In order to undo the coding, let us observe that the x’s and the u’s are 
related by the equation 


;=10x,— 200 or x,=0.1lu, + 20 


Continuing from here, simple algebra will show that x = 0.1a + 20 
and s, = 0.ls,, where s, and s, are, respectively, the standard devia- 
tions of the x’s and the u’s. Thus, we get 


x = 0,103.8) + 20 = 21.38 and 5, = 0.1(11.9) = 1.19 


which agrees with the values previously obtained. 


In Exercise 6 on page 157, the reader will be asked to verify that, in gen- 
eral, when data are coded so that 


X,=cu, ta 


the corresponding formulas for the mean and the standard deviation are 


X=cu+ta and s,=c:s, 


To calculate x and s from grouped data, we shall need to make some 


assumption about the distribution of the values within each class. If we 
represent all values within a class by the corresponding class mark, the 
sum of the x’s and the sum of their squares can now be written 


k k 
Sah and Sere 
i=1 t=1 
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where x; is the class mark of the ith class, f, is the corresponding class 
frequency, and k is the number of classes in the distribution. Substituting 
these sums into the computing formulas for x and s*, we obtain 





Example. To illustrate, let us return to the frequency table of the sulfur 
oxides emission data on page 137. Recording the class marks x; in the 
first column, the class frequencies f; in the second column, and the 
products x,f; and x?f, in the third and fourth columns, we obtain 





Xi Si xifi xP; 
6.95 3 20.85 144.9075 
10.95 10 109.50 1,199.0250 
14.95 14 209.30 3,129.0350 
18.95 25 473.75 8,977.5625 
22.95 17 390.15 8,953.9425 
26.95 9 2AZ aS 6.556.1225 
30.95 Zz 61.90 1,915.8050 
80 1,508.00 30,857.0000 


Substituting n = 80 and the necessary sums into the formulas for x and 
s, we obtain 


1,508 


aS 18.85 


x= 





and 
,  80(30,857) — (1,508)? _ 


These calculations were very tedious, and we went through them mainly 
to impress the reader with the simplifications that can be attained by 
coding, in this case coding the class marks. The coding we use in connection 
with distributions is to represent the class marks with successive integers, 
preferably with 0 near the center of the distribution or near the class which 
has the highest frequency. 
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Continuation of Example. Coding the class marks of our distribution 
as —3, —2, —1, 0, 1, 2, 3 and referring to these coded class marks as 
u’s, we find that the above computing table reduced to 


U; fi urf; uP fi 
—3 3 —9 Zs 
—2 10 —20 40 
—1 14 —14 14 

0 25 0 0 

1 17 17 17 

2 9 18 36 

3 Z 6 18 

80 —2 152 


and the mean and the variance of the w’s are 


<a 
= yy = —0.025 


21 


and 
2 _. 80(152) — (—2)? _ 
= BH.FQ 0 1,923 


u 


Since the coding we used is such that x, = 4u, + 18.95, the two for- 
mulas on page 154 yield x = 4(—0.025) + 18.95 = 18.85 and s? = 
42(1.923) = 30.77, which agree with the results previously obtained. 


In general, if c is the class interval of a distribution and x, is the class mark 
to which we assign 0 in the new scale, the coding is given by x, = c-u, + x, 
and, hence, the formulas for the mean and the standard deviation are 


X¥=cu+x, and s,=c's, 





ae EXERCISES 


\} Show that >> (x; — x) = 0. 
i=1 


} The following are the number of twists that were required to break 12 forged 
g alloy bars: 33, 24, 39, 48, 26, 35, 38, 54, 23, 34, 29, and 37. Find 
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(a) the mean; 

(b) the median; 

(c) s using the formula on page 151; 

(d) s using the computing formula on page 153. 


“The following are the I.Q.’s of 15 persons selected for jury duty by a court: 
108, 97, 89, 111, 127, 103, 92, 88, 110, 94, 118, 96, 103, 107, and 112. 
(a) Find the mean and comment on a lawyer’s claim that “the average juror 
has an J.Q. over 100.” 
(b) Find s and determine the percentage of these I.Q.’s that are within one 
standard deviation on either side of the mean. 


« 74. The following are 12 temperature readings taken at various locations in a 
3 large kiln (in degrees Fahrenheit): 475, 500, 460, 425, 460, 410, 470, 475, 460, 
510, 450, 415. 
(a) Find the mean. 
(b) Find the variance. 
Pe Find the coefficient of variation. 


Show that the computing formula for s2 on page 153 is equivalent to the one 
used to define it on page 151. 


6. If data are coded so that x; = cu; + a (see page 154), show that 


X=c uta and s,=c:'S, 


i Rework Exercise 4 by using the coding u = = — 90 and compare the 


results. 


8. Calculate x and s for the data of Exercise 9 on page 147 
(a) without coding; 
(b) after suitably coding the 15 measurements. 


9, Referring to the example on page 139, find the mean and the standard devia- 
tion of the data on absenteeism 
(a) by using the raw (ungrouped) data; 
(b) by using the frequency distribution obtained on page 140. 


10. Find the mean and the variance of the accident data of Exercise 13 on page 
148 
(a) by using the ungrouped data; 
(b) by using the frequency distribution obtained in that exercise. 


11. Use the distribution obtained in Exercise 3 on page 146 to find the mean and 
the standard deviation of the breaking strengths. 


12. Use the distribution obtained in Exercise 6 on page 147 to find the mean and 
the variance of the ignition times. Also determine the coefficient of variation. 


13. To find the median of a distribution obtained for n observations, we first 
determine the class into which the median must fall. Then, if there are j 
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15. 


values in this class and k values below it, the median is located ene) # 


of the way into this class, and to obtain the median we multiply this fraction 
by the class interval and add the result to the lower class boundary of the 
class into which the median must fall. This method is based on the assump- 
tion that the observations in each class are “spread uniformly” throughout 
n 


7 of the observations instead of 


the class interval, and this is why we count 


n+ 1 
2 
sulfur oxides emission data on page 139. Since n = 80, it can be seen that 
the median must fall into the class 17.0-20.95, and since 7 = 25 and k = 27, 

40 — 27 

er -4 = 19.03. | 

(a) Find the median of the distribution of the absenteeism data obtained on 
page 140. 

(b) Find the median of the distribution of breaking strengths obtained in 
Exercise 3 on page 146. 

(c) Find the median of the distribution of ignition times obtained in Exer- 
cise 6 on page 147. 





as on page 150. To illustrate, let us refer to the distribution of the 


it follows that the median is 16.95 + 





. If k sets of data consist, respectively, of 7;,2,...,, observations and have 
the means x1, X2,..., X,, then the overall mean of all the data is given by 
the formula 

k 

Dy 1:X; 
X= = 

OM 


(a) The average annual salaries paid to top-level management in three com- 
panies are $54,000, $62,000, and $59,000. If the respective numbers of 
top-level executives in these companies are 4, 15, and 11, find the average 
salary paid to these 30 executives. 

(b) If the mean weight of the 4 offensive starting backs of a professional 
football team is 214 pounds and the mean weight of the 7 offensive 
starting linemen is 258 pounds, what is the mean weight of this starting 
eleven ? 

(c) Prove this formula for the overall mean of several sets of data. 

The formula of Exercise 14 is a special case of the following formula for 

computing weighted means: 





where w, is a weight indicating the relative importance of the ith observation. 
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(a) If an instructor counts the final examination in a course four times as 
much as each one-hour examination, what is the weighted average grade 
of a student who received grades of 69, 75, 56, and 72 in four one-hour 
examinations and a final examination grade of 78? 

(b) If someone invests $4,000 at 5.5 percent in a savings account, $10,000 at 
7.25 percent in a certificate of deposit, and $18,000 at 9 percent in second 
mortgages, find the average percentage yield of these investments. 





Sampling 
Distributions 


6.1 POPULATIONS AND SAMPLES 


Usage of the term “population” in statistics is a carry-over from the days 
when statistics was applied mainly to sociological and economic phe- 
nomena. Nowadays, it is applied to any set or collection of objects, actual 
or conceptual, and mainly to sets of numbers, measurements, or observa- 
tions. For example, if we are interested in determining the average number 
of television sets per household in the United States, the totality of these 
figures, one for each household, constitutes the population for this study. 
Similarly, the population from which inspectors draw a sample to deter- 
mine some quality characteristic of a manufactured product may be the 
corresponding measurements for all units in a given lot; depending on the 
objectives of the inspection, it may also consist of the corresponding 
measurements for a// units that may conceivably be manufactured. 


160 


161 6.1 Populations and Samples 


In some cases, such as the above example concerning the number of 
television sets per household, the population is finite; in other cases, such 
as the determination of some characteristic of all units, past, present, and 
future, that might conceivably be manufactured by a given process, it is 
convenient to think of the population as infinite. Similarly, we look upon 
the results obtained in a series of flips of a coin as a sample from the hypo- 
thetically infinite population consisting of all conceivably possible flips of 
the coin. 

Populations are often described by the distributions of their values, and 
it is common practice to refer to a population in terms of this distribution. 
(For finite populations, we are referring here to the actual distribution of 
its values; for infinite populations, we are referring to the corresponding 
probability distribution or probability density.) For example, we may 
refer to a number of flips of a coin as a sample from a “binomial popula- 
tion” or to certain measurements as a sample from a“normal population.” 
Hereafter, when referring to a “population f(x)” we shall mean a popula- 
tion such that its elements have a frequency distribution, a probability 
distribution, or a density with values given by f(x). 

If a population is infinite it is impossible to observe all its values, and 
even if it is finite it may be impractical or uneconomical to observe it in its 
entirety. Thus, it is usually necessary to use a sample, a part of a popula- 
tion, and infer from it results pertaining to the entire population. Clearly, 
such results can be useful only if the sample is in some way “representa- 
tive” of the population. It would be unreasonable, for instance, to expect 
useful generalizations about the population of 1974 family incomes in the 
United States on the basis of data pertaining to home owners only. Simi- 
larly, we can hardly expect reasonable generalizations about the perfor- 
mance of a tire if it is tested only on smooth roads. To assure that a sample 
is representative of the population from which it is obtained, and to pro- 
vide a framework for the application of probability theory to problems of 
sampling, we shall confine the use of the term “sample” to so-called random 
samples. For sampling from finite populations, random samples are defined 
as follows: 


A set of observations X,,X2,...5X, constitutes a random sample 
of size n from a finite population of size N, if it is chosen so that each 


subset of n of the N elements of the population has the same prob- 
ability of being selected. 





Note that this definition of randomness pertains essentially to the manner 
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in which the sample values are selected. This holds also for the following 
definition of a random sample from an infinite population: 









A set of observations x,, Xz,..., X, constitutes a random sample of 
size n from the infinite population f(x) if: 


I. Each x; is a value of a random variable whose distribution is given 


by f(x). 


2. These n random variables are independent. 






There are several ways of assuring the selection of a sample that is at 
least approximately random. When dealing with a finite population, we 
can serially number the elements of the population and then select a sam- 
ple with the aid of a table of random digits (see discussion on page 92). 


Example. For instance, if a population nas N = 500 elements and we 
wish to select a random sample of size n = 10, we can use three arbi- 
trarily selected columns of Table 7 to obtain 10 different three-digit 
numbers less than or equal to 500, which will then serve as the serial 
numbers of the elements to be included in the sample. 


When the population size is large, the use of random numbers can 
become very laborious and at times practically impossible. For instance, if 
a sample of five cartons of canned peaches is to be chosen for inspection 
from among the many thousands stored in a warehouse, one can hardly 
expect to number all the cartons, make a selection with the use of random 
numbers, and then pull out the ones that were chosen. In a situation like 
this, one really has very little choice but to make the selection relatively 
haphazard, hoping that this will not seriously violate the assumption of 
randomness which is basic to most statistical theory. 

When dealing with infinite populations, the situation is somewhat dif- 
ferent since we cannot physically number the elements of the population; 
but efforts should be made to approach conditions of randomness by the 
use of artificial devices. For example, in selecting a sample from a produc- 
tion line we may be ‘able to approximate conditions of randomness by 
choosing one unit each half hour; when tossing a coin we can try to flip it 
in such a way that neither side is intentionally favored, and so forth. The 
proper use of artificial or mechanical devices for selecting random samples 
is always preferable to human judgment, as it is extremely difficult to avoid 
unconscious biases when making almost any kind of selection. 
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Even with the careful choice of artificial devices, it is all too easy to 
commit gross errors in the selection of a random sample. To illustrate 
some of these pitfalls, suppose we have the task of selecting logs being fed 
into a sawmill by a constant-speed conveyer belt, for the purpose of obtain- 
ing a random sample of their lengths. One sampling device, which at first 
sight would seem to assure randomness, consists of measuring the logs 
which pass a given point at the end of a certain number of ten-minute 
intervals. However, further thought reveals that this method of selection 
favors the longer logs, since they require more time to pass the given point. 
Thus, the sample is not random since the longer logs have a better chance 
of being included. Another common mistake in selecting a sample 1s that 
of sampling from the wrong population or from a poorly specified popula- 
tion. As we have pointed out earlier, we would hardly get a sample from 
which we could generalize about family incomes in the United States if 
we limited our sample to home owners. Similarly, if we wanted to deter- 
mine the effect of vibrations on a structural member, we should be careful 
to delineate the frequency band of vibrations that is of relevance, and to 
vibrate test specimens only at frequencies selected randomly from this 
band. 

The purpose of most statistical investigations is to generalize from in- 
formation contained in random samples about the population from which 
the samples were obtained. In particular, we are usually concerned with 
the problem of making inferences about the parameters of populations, 
such as the mean yw and the standard deviation a. In making such infer- 
ences, we usually use statistics such as X and s, namely, quantities calculat- 
ed on the basis of sample observations. Since the selection of a random 
sample is controlled largely by chance, so are the values we obtain for 
these statistics. The remainder of this chapter will be devoted to a discus- 
sion of sampling distributions—distributions which describe the chance 
fluctuations of statistics calculated on the basis of random samples. 


6.2 THE SAMPLING DISTRIBUTION OF THE 
MEAN (a0 KNOWN) 


Suppose a random sample of n observations has been taken from some 
population and that x has been computed, say, to estimate the mean of the 
population. It should be clear that if we took a second random sample of 
size n from this population, it would be quite unreasonable to expect the 
identical value for x, and if we took several more samples, probably no 
two of the x’s would be alike. The differences among such x’s are generally 
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attributed to chance, and this raises important questions concerning their 
distribution, specifically concerning the extent of their chance fluctuations. 

To approach this question experimentally, let us consider the experi- 
ment described in the following example: 


Example. Suppose that 50 random samples of size n = 10 are to be 
taken from a population having the discrete uniform distribution 
f(x) = jy  forx=0,1,2,...,9 
0 elsewhere 
A convenient way of obtaining these samples is to use a table of random 
digits, like Table 7, letting each sample be a set of 10 consecutive 


digits in arbitrarily chosen rows or columns. The means of 50 samples 
thus obtained are 


44 32 50 35 41 44 36 65 53 4.4 
a) 3.3 38 43 3.3 3.0 - 492 438. 3.1 33 
3.0 30 46 5.8 46 40 3.7 5.2 3.7 3.8 
5.35.55 48 64 49 65 35 45 4.9 5.3 
3.6 2.7 40 5.0 2.6 4.2 44 5.6 4.7 4.3 


and the following is a frequency table showing their distribution: 


x Frequency 
2.0-2.9 Z 
3.0-3.9 14 
4.0-4.9 19 
5.0-5.9 12 
6.0-6.9 3 

50 


It is apparent from the histogram representing the above distribution 
in Figure 6.1, that the distribution of the x’s is fairly bell-shaped, even 
though the population itself has a uniform distribution. This raises the 
question whether this kind of result is typical of what we might expect in 
the long run; in other words, would we get a similar distribution if we took 
100 samples, 1,000 samples, or perhaps even more? To answer this kind of 
question, we shall have to investigate the theoretical sampling distribution 
of x which, for the given example, provides us with the probabilities of 
getting x’s between 2.0 and 2.9, between 3.0 and 3.9,..., between 6.0 and 
6.9, and perhaps values less than 2.0 or greater than 6.9. Although we 


o _— 


165 6.2 The Sampling Distribution of the Mean (o Known) 





Figure 6.1. Experimental sampling distribution of the mean. 


could actually evaluate these probabilities for this example, it is usually 
sufficient to refer to some general theorems concerning sampling distribu- 
tions. The first of these, stated as follows, gives expressions for the mean 
and the variance of sampling distributions of x: 





THEOREM 6.1. Ifa random sample of size n is taken from a popula- 
tion having the mean yw and the variance a*, then x is a value of 
a random variable whose distribution has the mean wu. For samples 
from infinite populations the variance of this distribution is o”/n; 
for samples from finite populations of size N the variance is 
o*7 N—n 

n N—1L 





Writing yw; for the mean of the sampling distribution of x, we shall first 
prove for the continuous case that uw; = uw. (The proof for the discrete 
case follows the identical steps, with integral signs replaced by }’s.) Using 
the definition on page 129, we have 


te = | | | D) fers Xap ees Xp) dey Oita... AR, 


i=1 


=i% | | ma | ek (Lis Mos vg Se Ay ans Oy 
i=1 ey eer — oo 
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where f(x,, X,,..., X,) is a value of the joint density of the random vari- 
ables whose values constitute the random sample. Using the assumption 
of randomness, we can write 


F(X 1y Xay 6 0 Xn) = Si %2)+ «Sf %n) 


and we now have 


FE [sedan [ xfeodu.. [sedan 


f= 


Since each integral except the one with the integrand x, f(x,) equals 1, and 
the one with the integrand x,f(x,) equals yw, we finally obtain 


p= Ly 
n 


i=1 


and this completes the proof of the first part of the theorem. 

To prove that for random samples from infinite populations o2, the 
variance of the sampling distribution of %, equals o?/n, we shall make the 
simplifying assumption that uw = 0. As the reader will be asked to show in 
Exercise 10 on page 172, this does not involve any loss of generality; as a 
matter of fact, we stated a similar result on page 154, which showed that 
addition of a constant to each value does not affect the standard deviation 
(or the variance) of a set of data. Using the definition on page 129, we 
thus have 


Co. = a a Rak {- Be 4s Rie dw og By) he xe 
and making use of the fact that 


e= MSs) Me ape) 


we obtain 


_ i 7. et Ci Nog. s ena ey ey veg 


ef .. [ef Ce eos) diy diy di “a 
ap 


where >/ >; extends over all i andj from 1 to n, not including the terms 
tA 


where / = j. Again using the fact that f(x,, x,,..., x,) = f(x1) f(%2)° 
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- f(x,,), we can write each of the above multiple integrals as a product of 
simple integrals, with each simple integral that has an integrand of the 


form f(x) equalling 1. We thus obtain 


a=50 [. © xB (a) dx, + pap [ xif (xi) dx; | : xf (x)) dx; 


and since each integral in the first sum equals o? while each integral in the 
second sum equals 0, we finally have 


1] 
i= 30? 


This completes the proof of the second part of the theorem. We shall not 
prove the corresponding result for random samples from finite populations, 


but it should be noted that in the resulting formula for o2 the factor a — 7 
often called the correction factor for finite populations, is close to | (and can 
be omitted for most practical purposes) unless the sample constitutes a 


substantial portion of the population. 





Example. For instance, if a random sample of size n = 10 is drawn 
from a population of size N = 1,000, then ent is approximately 


N—1 
0.991, 


Although it is not very surprising that the mean of the theoretical 
sampling distribution of x equals the mean of the population, the fact that 
its variance equals o?/n, for random samples from infinite populations, is 
interesting and important. To point out its implications, let us apply 
Chebyshev’s theorem to the sampling distribution of x, substituting x for 
x anda/,/n foro in the inequality given on page 75. We thus obtain 


2 ko = 
P(iz— ul< 2) 21 ja 





and letting ka/,/n = €, we get 


P(ix—-—pl<egpll- a7 
Thus, for any given € > 0, the probability that x differs from y by less than 


€ can be made arbitrarily close to 1 by choosing n sufficiently large. In less 
rigorous language, the larger the sample size, the closer we can expect x to 
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be to the mean of the population. In this sense we can say that < becomes 
more and more reliable as an estimate of u as the sample size is increased. 
The reliability of * as an estimate of wis often measured by the expression 
o/,/n, also called the standard error of the mean. Note that this measure 
of the reliability of x decreases in proportion to the square root of n; for 
example, it is necessary to quadruple the size of the sample in order to halve 
the standard deviation of the sampling distribution of the mean. This also 
indicates what might be called a “law of diminishing returns” so far as 
increasing the sample size is concerned. Usually it does not pay to take 
excessively large samples since the extra labor and expense is not accom- 
panied by a proportional gain in reliability. For instance, if we increase the 
size of a sample from 25 to 2,500, the errors to which we are exposed are 
reduced only by a factor of 10. 


Example. Let us now return to the experimental sampling distribution 
on page 164, and let us check how closely its mean and variance corre- 
spond to the values we should expect in accordance with Theorem 6.1. 
Since the population from which the 50 samples of size 10 were obtain- 
ed has the mean 


and the variance 
a? = ¥ (x — 4.5/2 = 8.25 
x=0 


Theorem 6.1 leads us to expect a mean of uw, = 4.5 and a variance of 
o% = 8.25/10 = 0.825. Calculating the mean and the variance from 
the frequency table on page 164, we obtain X, = 4.45 and 52 = 0.939, 
which are reasonably close to the theoretical values. 


Theorem 6.1 provides only partial information about the theoretical 
sampling distribution of the mean. In general, it is impossible to determine 
such a distribution exactly without knowledge of the actual form of the 
population, but it is possible to find the limiting distribution as n —> co 
of a statistic closely related to X assuming only that the population has a 
finite variance o?. The statistic we are referring to here is the standardized 
mean 

iad 
a/,/n 
namely, the difference between x and yz divided by the standard deviation 
of the sampling distribution of x. With reference to this statistic we have 
the following form of the central limit theorem: 
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THEOREM 6.2. If x is the mean of a random sample of size n taken 
from a population having the mean ys and the finite variance 0°, then 


_xX—H 





Z 


ol. Jn 


is the value of a random variable whose distribution function ap- 
proaches that of the standard normal distribution as n — oo, 





We shall not be able to prove this theorem in this text, but at least 
partial (experimental) verification may be obtained by plotting the cumu- 
lative percentage distribution corresponding to the distribution of Figure 
6.1 on arithmetic probability paper. As can be seen from Figure 6.2, the 





Figure 6.2. Experimental verification of central limit theorem. 
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points fall close to a straight line, and it seems that even for n = 10 the 
sampling distribution of x for this example follows the overall pattern of a 
normal distribution. In practice, the normal distribution provides an ex- 
cellent approximation to the sampling distribution of x for n as small as 
25 or 30, with hardly any restrictions on the shape of the population. As we 
_ Saw in our example, the sampling distribution of x had the general shape 
of a normal distribution even for samples of size 10 from a discrete uni- 
form distribution, and this is true in general provided the shape of the 
population distribution is not too skewed. In fact, it can be shown that 
the sampling distribution of x is exactly normal (regardless of the size of 
the sample), when the sample is obtained from a normal population. 


Example. To illustrate the actual use of Theorem 6.2, suppose it is 
known that a one-gallon can of a certain kind of paint will cover on 
the average 513.3 square feet with a standard deviation of 31.5 square 
feet. Suppose, furthermore, we want to know the probability that the 
mean area covered by a sample of 40 of these one-gallon cans will be 
anywhere from 510.0 to 520.0 square feet. By Theorem 6.2 we shall 
have to find the normal curve area between 


510.0 — 513.3 520.0 — 513.3 
31.5/,/40 Se Se Bag, Fe 


and checking these values in Table 3 we obtain a probability of 
0.6553. Note that if x turned out to be much less than 513.3, say, less 
than 500.0, this might cause serious doubt whether the sample actually 
came from a population having w = 513.3 and o = 31.5; the prob- 
ability of obtaining such a small value (a z-value less than 2.67) is only 
0.0038. 


EXERCISES 


A An inspector examines every 20th piece coming off an assembly line. List 
some of the conditions under which this method might not yield a random 


Sample. 

4 In 1932 the Literary Digest predicted the presidential election by random 
sampling from telephone directories and from its list of subscribers. The 
prediction was grossly incorrect; explain why. 
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Ge 
nn market research organization wants to try a new product in 8 of the 50 
states. Use random numbers (Table 7) to make this selection. 


4. Take 30 slips of paper and label five each —4 and 4, four each —3 and 3, 

three each —2 and 2, and two each —3}, 0, and 1. 

(a) If each slip of paper has the same probability of being drawn, find the 
probability of getting —4, —3, —2, —1, 0, 1, 2, 3, 4, and find the mean 
and the variance of this distribution. 

(b) Draw 50 samples of size 10 from this population, each sample being 
drawn without replacement, and calculate their means. 

(c) Calculate the mean and the variance of the 50 means obtained in part 
(b). 

(d) Compare the results obtained in part (c) with the corresponding values 
expected according to Theorem 6.1. [Note that 4 and o? were obtained 

vA in part (a).] 


Repeat Exercise 4, but select each sample with replacement; that is, replace 
each slip of paper and reshuffle before the next one is drawn. 


2 | 6. Given the infinite population whose distribution is given by 


x f(x) 
1 0.25 
2 0.25 
3 0.25 
4 0.25 


list the 16 possible samples of size 2 and use this list to construct the distri- 
bution of x for random samples of size 2 from the given population. Verify 
that the mean and the variance of this sampling distribution are identical 
with the corresponding values expected according to Theorem 6.1. 


f (( 7. When we sample from an infinite population, what happens to the standard 
. error of the mean if 


(a) the sample size is increased from 80 to 720; So *% R260 
(b) the sample size is increased from 100 to 225; “MOC tF GOO 
(c) the sample size is decreased from 750 to 30? ao ce aS 
64S tR ¥0 
8. What is the value of the finite population correction factor in the formula for 
Oo; when 


(a) n = 5 and N = 200; 
(bY n = 10 and N = 400; 
) n = 100 and N = 5,000? 


. Show that there is a fifty-fifty chance that the mean of a random sample of 
size n from an infinite population with the standard deviation @ will deviate 
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from i by less than 0.6745 rom, It has been the custom to refer to this 


quantity as the probable error of the mean. 


b i A 010. If x is a value of a continuous random variable and y = ax + b, show that 


(a) Mx = aut, + b; 
(b) 6? =a@’o?. 


11. Prove that “4; = mu for random samples from discrete (finite or countably 
infinite) populations. | 


L / .. 12. A random sample of size 36 is taken from an infinite population having the 


mean ££ = 63 and the variance 0? = 81. What can we assert about the prob- 
ability of getting a sample mean greater than 66.75 using 

(a) Chebyshev’s theorem; 

(b) the central limit theorem given on page 169? 


13. A random sample of size 100 is taken from an infinite population having the 
mean ££ = 76 and the variance 0? = 256. What is the probability of getting 
an x between 75 and 78? 


14. If measurements of the specific gravity of a metal can be looked upon as a 
sample from a normal population having a standard deviation of 0.04, what 
is the probability that the mean of a random sample of size 25 will be “off” 
by at most 0.02? 


15. A wire-bonding process is said to be in control if the mean pull strength is 
10 pounds. It is known that the pull-strength measurements are normally 
distributed with a standard deviation of 1.5 pounds. Periodic random sam- 
ples of size 4 are taken from this process and the process is said to be “out of 
control” if a sample mean is less than 7.75 pounds. Comment. 


(416. If the distribution of the weights of all men traveling by air between Dallas 


and El Paso has a mean of 163 pounds and a standard deviation of 18 
pounds, what is the probability that the combined gross weight of 36 men 
traveling on a plane between these two cities is more than 6,000 pounds? 


fp 


oy THE SAMPLING DISTRIBUTION OF THE 
| MEAN (0 UNKNOWN) 


Application of the theory of the preceding section requires knowledge 
of the population standard deviation o. If 7 is large, this does not pose any 
problems as it is reasonable to use the theory also when o is unknown, 
substituting for it the sample standard deviation s. So far as the statistic 


x — U 
S]/ n 
values of m unless we make the assumption that the sample comes from a 
normal population. Under this assumption, it is possible to prove the fol- 





is concerned, little is known about its exact distribution for small 
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lowing theorem: 


THEOREM 6.3. Jf x is the mean of a random sample of size n taken 
from anormal population having the mean yu and the variance oa”, 
then 


t= ~—# 


sh /n 


is the value of a random variable having the Student-t distribution* 
with the parameter y = n — 1, 





This theorem is more general than Theorem 6.2 in the sense that it does not 
require knowledge of a; on the other hand, it is /ess general than Theorem 
6.2 in the sense that it requires the assumption of a normal population. 

As can be seen from Figure 6.3, the overall shape of a ¢ distribution is 
similar to that of a normal distribution—both are bell-shaped and sym- 
metrical about the mean. Like the standard normal distribution, the f¢ dis- 
tribution has the mean 0, but its variance depends on the parameter v (nu), 
called the number of degrees of freedom. The variance of the ¢ distribution 
exceeds 1, but it approaches | as nm — oo. In fact, it can be shown that the 
t distribution with y degrees of freedom approaches the standard normal 
distribution as vy — ©, 





Figure 6.3. Student-t and standard normal distributions. 


*The Student-r distribution was first investigated in 1908 by W. S. Gosset, who 
published his findings under the pen name of “Student” because his employers did not 
permit the publication of research done by their staff. 
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Table 4 at the end of the book contains selected values of ¢, for various 
values of v, where ¢, is such that the area under the ¢ distribution to its right 
is equal to a. In this table the left-hand column contains values of v, the 
column headings are areas «@ in the right-hand tail of the ¢ distribution, and 
the entries are values of t, (see also Figure 6.4). It is not necessary to tabu- 
late values of t, for a > 0.50, as it follows from the symmetry of the ¢ 
distribution that ¢,_, = —t,; thus, the value of ¢ that corresponds to a 


left-hand tail area of w is —f,. 





Figure 6.4. Tabulated values of f. 


Note that in the bottom row of Table 4 the entries correspond to the 
values of z which cut off right-hand tails of area «% under the standard 
normal curve. Using the notation z, for such a value of z, it can be seen, 
for example, that z.5.; = 1.96 = t.9.; for vy = oo. This result should really 
have been expected since the ¢ distribution approaches the standard normal 
distribution as y — oo. In fact, observing that the values of t, for 29 or 
more degrees of freedom are close to the corresponding values of z,, we 
conclude that the standard normal distribution provides a good approxima- 
tion to the t distribution for samples of size 30 or more. 


Example. To illustrate the use of the ¢ distribution, suppose that a 
manufacturer of fuses claims that with a 20 percent overload, his fuses 
will blow in 12.40 minutes on the average. To test this claim, a sample 
of 20 of the fuses was subjected to a 20 percent overload, and the times 
it took them to blow had a mean of 10.63 minutes and a standard 
deviation of 2.48 minutes. On the assumption that these times may be 
looked upon as a random sample from a normal population with 
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hu = 12.40, the statistic 


10.36 — 12.40 
pa ee Ss 9,19 
2.48/,/20 


is a value of a random variable having the ¢ distribution with y = 
20 — 1 = 19 degrees of freedom. Now, from Table 4 we find that for 

= 19 the probability that ¢ will exceed 2.861 is 0.005, and hence we 
conclude that the probability that ¢ will be less than —2.861 is also 
0.005. Since the value we obtained for our example is t = —3.19, we 
are faced with the following two alternatives: either it is true that 
jt = 12.40 minutes and we have observed a relatively rare event, or 
ut 12.40 minutes (in fact, it is less than 12.40). If we actually had to 
make this choice, we might well be inclined to accept the second alter- 
native, namely, to reject the manufacturer’s claim. 


The assumption that the sample must come from a normal population 
is not so severe a restriction as it may first seem. Studies have shown that 


the distribution of the statistic t = aie is fairly close to a ¢ distribution 
even for samples from certain nonnormal populations. In practice, it is 
necessary to make sure primarily that the population from which we are 
sampling is approximately bell-shaped and not too skewed. A practical 
way of checking this assumption is to plot the observations on arithmetic 
probability paper as described on page 143. (If such a plot shows a dis- 
tinct curve rather than a straight line, it may be possible to “straighten it 
out” by transforming the data—say, by taking their logarithms or their 
square roots.) 





.4 THE SAMPLING DISTRIBUTION 
OF THE VARIANCE 


So far we have discussed only sampling distributions of the mean, but 
if we had taken the medians or the standard deviations of the 50 samples 
mentioned in the example on page 164, we would similarly have obtained 
experimental sampling distributions of these statistics. In this section we 
shall be concerned with the theoretical sampling distribution of s? for ran- 
dom samples from normal populations. Since s? cannot be negative, we 
should expect that this sampling distribution is not a normal curve; in fact, 
it is related to a gamma distribution with the parameters « = v/2 and 
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B = 2 (see page 117), called the chi-square distribution. Specifically, we 
have 


THEOREM 6.4. If s* is the variance of a random sample of size n 
taken from anormal population having the variance o*, then 


2 _ (n— 1)s* 


X 


o2 


is a value of a random variable having the chi-square distribution 
with the parameter y =n — 1. 





Table 5 at the end of the book contains selected values of v2 for vari- 
ous values of y, again called the number of degrees of freedom, where 7? 
is such that the area under the chi-square distribution to its right is equal 
to a. In this table the left-hand column contains values of vy, the column 
headings are areas « in the right-hand tail of the chi-square distribution, 
and the entries are values of v2 (see also Figure 6.5.) Unlike the ¢ distribu- 
tion, it is necessary to tabulate values of v2 for « > 0.50, because the chi- 
square distribution is not symmetrical. 





Figure 6.5. Tabulated values of chi-square. 


Example. To illustrate the use of Theorem 6.4, suppose an optical 
firm purchases glass to be ground into lenses, and that past experience 
has shown that the variance of the refractive index of this kind of glass 
is 1.26-10-4. To grind the glass into lenses having a given focal length, 
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it is important that the various pieces of glass have nearly the same 
index of refraction; hence, let us suppose that such a shipment of glass 
is to be rejected if the sample variance of 20 pieces selected at random 
exceeds 2.00-10-4. Assuming that the sample values may be looked 
upon as coming from a normal population with 0? = 1.26-10~4, the 
probability that a shipment will be erroneously rejected may be com- 
puted as follows. First we obtain 


, _ 19(2.00-10-4) _ 
x" = 196-1974 30.2 


and then we find from Table 5 that for 19 degrees of freedom %3,, = 
30.1. Thus, the probability that a good shipment will erroneously be 
rejected by this criterion is less than 0.05. 


A problem closely related to that of finding the distribution of the 
sample variance is that of finding the distribution of the ratio of the vari- 
ances of two independent random samples. This problem is important 
because it arises in tests in which we want to determine whether two sam- 
ples come from populations having equal variances. If they do, the two 
sample variances should be nearly the same; that is, their ratio should be 
close to 1. To determine whether the ratio of two sample variances is too 
small or too large, we use the theory given in the following theorem: 





THEOREM 6.5. If s{ and s% are the variances of independent random 
samples of size n, and n,, respectively, taken from two normal popu- 
lations having the same variance, then 


-=b 


Px 


S 
3 


is a value of a random variable having the F distribution with the 
parameters v,; =n, — l andy, =n, — 1. 





The F distribution is related to the beta distribution (page 120), and it 
has the two parameters v,, the degrees of freedom for the sample variance in 
the numerator, and v,, the degrees of freedom for the sample variance in the 
denominator; when referring to a particular F distribution, we always 
give first the degrees of freedom for the numerator. As it would require too 
large a table to give values of F, corresponding to many different right- 
hand tail areas «, and since « = 0.05 and « = 0.01 are most commonly 
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Figure 6.6. Tabulated values of F. 


used, Table 6 contains only values of F.); and F.9,; for various combina- 

tions of values of vy, and v, (see also Figure 6.6). | 
It is possible to use Table 6 also to find values of F corresponding to 

left-hand tails of area 0.05 and 0.01. Writing F,(v,, v,) for F, with vy, and 


vy, degrees of freedom, we simply use the identity 


1 
F mre Vv 5 Vv = 
( 2) F (V2; V1) 
Example. For instance, to find F, for 10 and 20 degrees of freedom and 
a = 0.95, we have only to write 
al, 


F..(20,10) 2.77 





F.,;(10, 20) = = 0.36 


Note that Theorem 6.4 as well as Theorem 6.5 requires the assumption 
that we are sampling from normal populations. As in the case of the ¢ 
distribution, this assumption can be loosened somewhat in actual practice 
without materially altering the respective sampling distributions. Again, 
the use of arithmetic probability paper is suggested to investigate whether 
it is reasonable to treat the samples as coming from normal populations. 


di , EXERCISES 


A random sample/6f size 25 from a normal population has the mean x = 
47.5 and the standard deviation s — 8.4. Basing the decision on the ¢ statis- 
tic, is it reasonable to say that this information supports the claim that the 
mean of the population is 42.1? 


> 


D> 






: are Sampling Distribution of the Variance 

,u it | 

2. The following are the times between six calls for an ambulance (in a certain 
city) and the patient’s arrival in the hospital: 27, 15, 20, 32, 18, and 26 
minutes. Use the f¢ statistic to judge the reasonableness of the ambulance 
service’s claim that it takes on the average 20 minutes between the call for an 

lance and the patient’s arrival at the hospital. 


3. A process for making certain bearings is under control if the diameters of 
the bearings have a mean of 0.5000 cm. What can we say about this process if 
a sample of ten of these bearings has a mean diameter of 0.5060 cm and a 
standard deviation of 0.0040 cm? 


AS \ Dhe claim that the variance of a normal population is @2 = 21.3 is rejected 
if the variance of a random sample of size 15 exceeds 39.74. What is the prob- 
ability that the claim will be rejected even though a2 = 21.3? 
a 


5, random sample of 10 observations is taken from a normal population 
having the variance 0% = 42.5, Find the approximate probability of obtain- 
haces standard deviation between 3.14 and 8.94. 
6/If 


two independent random samples of size ny = 9 and n, = 16 are taken 
from a normal population, what is the probability that the variance of the 
first sample will be at least 4 times as large as the variance of the second 
sample? 


7. With reference to Exercise 6, find the probability that either sample variance 
will be 4 times as large as the other. 


By he chi-square distribution with 4 degrees of freedom is given by 
C) 


(,*? _ fdrxce*/2? x > 0 
iGy= é x<0 


Find the probability that the variance of a random sample of size 5 from a 
normal population with o = 12 exceeds 180. 


9. The ¢ distribution with 1 degree of freedom is given by 


f= =(1 git asee Sp es 


| Verify” the corresponding value of ¢t); in Table 4. 
, ae, a 


(10yT he F distribution with 4 and 4 degrees of freedom is given by 
—~ 4a es 
e +7 6F1+F)+ F>0 
fF) =| 
0 F<0 


If random samples of size 5 are taken from two normal populations having 
the same variance, find the probability that the ratio of the larger to the 
smaller sample variance exceeds 3. 





Inferenees 
Concerning 
Means 


7.1 POINT ESTIMATION 


On page 163 we stated that the purpose of most statistical investigations 
is to generalize from information contained in random samples about the 
populations from which the samples were obtained, namely, to make 
a statistical inference. In the classical approach to statistical inference, 
these methods are divided into the major areas of estimation and tests 
of hypotheses, with estimation subdivided further into point estimation 
and interval estimation. More recently, the methods of statistical inference 
have been unified under the general concepts of decision theory, that is, 
under the general concepts of decision making in the face of uncertainty. 
In this section we shall discuss the general problem of point estimation, 
applying it to the estimation of a population mean, y, with the mean of 
a random sample. 


180 
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Basically, point estimation concerns the choosing of a statistic, a single 
number calculated from sample data (and perhaps other information), 
for which we have some expectation, or assurance, that it is “reasonably 
close” to the parameter it is supposed to estimate. To explain what we 
mean here by “reasonably close” is not an easy task; first, the value of 
the parameter is unknown, and second, the value of the statistic is un- 
known until after the sample has been obtained. Thus, we can only ask 
whether, upon repeated sampling, the distribution of the statistic has 
certain desirable properties akin to “closeness.” For instance, we know 
from Theorem 6.1 that the sampling distribution of x has the same mean 
as the population from which the sample is obtained; hence, we can 
expect that the means of repeated random samples from a given popula- 
tion will center on the mean of this population and not about some other 
value. To formulate this property more generally, let us now make the 
following definition: 


a 


A statistic 8 is said to be an unbiased estimate of the parameter @ if 
and only if the mean of its sampling distribution is equal to 0. 





Thus, we call a statistic unbiased if “on the average” its values can be 
expected to equal the parameter it is supposed to estimate. 

Generally speaking, the property of unbiasedness is one of the more 
desirable properties in point estimation, although it is by no means 
essential and it is sometimes outweighed by other factors. One short- 
coming of the criterion of unbiasedness is that it will generally not provide 
a unique statistic for a given problem of estimation. 


Example. For instance, it can be shown that for random samples of 
ax, + bx, 

a+b ’ 
where a and 5b are positive constants, are unbiased estimates of the 
mean of the population. For samples of any size, so are the sample 
median and the midrange (the mean of the largest value and the 
smallest) if we assume, furthermore, that the population is symmetric. 


size 2 the mean m1 a as well as the weighted mean 


This suggests that we must seek a further criterion for deciding which 
of several unbiased estimates is “best” for estimating a given parameter. 
Such a criterion becomes evident when we compare the sampling distri- 
butions of the mean and the median for random samples of size n taken 
from the same normal population. Although these two distributions have 
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the same mean, namely, the population mean yw, and although they are 
both symmetrical and bell-shaped, their variances differ. In Theorem 6.1 
we proved that for infinite populations the variance of the sampling dis- 
tribution of the mean is o2/n, and it can be shown for the corresponding 
sampling distribution of the median that its variance is approximately 
1.57-¢2/n. Thus, it is more likely for any given sample that the mean will 
be closer to u# than the median. Note that this does not imply that the 
sample mean is necessarily always closer than the median; in fact, in any 
given problem we have no way of knowing which of the two is actually 
closer. 

This important property, in which we compare the variances of the 
sampling distributions of statistics, is formalized by means of the following 
definition: 





A statistic 0 _ is said to be a more efficient unbiased estimate of the 
parameter 6 than the statistic @,, if: 
i 6, and 6, are both unbiased estimates of 0. 


2. The variance of the sampling distribution of 6, is less than the 
variance of the sampling distribution of @3. 





We have, thus, seen that for random samples from normal populations the 
mean is more efficient than the median as an estimate of w; in fact, it can 
be shown that in most situations met in actual practice the variance of the 
sampling distribution of no other statistic is less than that of the sampling 
distribution of the mean. In other words, in most practical situations the 
sample mean is an acceptable statistic for estimating a population mean y. 
(There exist several other criteria for assessing the “goodness” of methods 
of point estimation, but we shall not discuss them in this book.) 

When we use a sample mean to estimate the mean of a population, 
we know that although we are using a method of estimation which has 
certain desirable properties, the chances are slim, virtually nonexistent, 
that the estimate is exactly equal to uw. Hence, it would seem desirable to 
accompany such a point estimate of uw with some statement as to how 
close we might reasonably expect the estimate to be. The error, x — HW, 
is the difference between the estimate and the quantity it is supposed to 
estimate. To examine this error, let us make use of the fact that for large n 


tau 
ao|,/ n 
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is a value of a random variable having approximately the standard 
normal distribution. Consequently, we can assert with a probability of 
1 — o that 





Z 4/2 = ee ao, 
aye 
or that 
a = JE} 
——_—— < Z, 
o|,/ n /2 


where Z,,. is such that the normal curve area to its right equals «/2. If we 
now let E stand for |x — uw|, the magnitude of the error of estimate, we 
have 


Es, 25 : 





arn 


with a probability of 1 — a. In other words, if we estimate uw by means of 
a random sample of size n, we can assert with a probability of 1 — « that 
the error, |x — |, is less than z,,.-0/,/n, at least for sufficiently large 
values of n. 


Example. To illustrate, suppose that the mean of the sulfur oxides 
emission data on page 137, namely, x = 18.85 tons, is to be used to 
estimate yz, the plant’s actual average daily emission of these pollutants. 
To see what we can assert with a probability of 0.95 about the possible 
size of the error of this estimate, let us approximate o with the value 
obtained for s on page 155, namely, ,/30.77 = 5.55. This is justifiable 
since the sample was large. Thus, substituting s = 5.55, n = 80, and 
Z,/2 = 1.96 into the formula for F, we find that we can assert with a 
probability of 0.95 that the error of the estimate is less than 


1.96 - DO 1.22 tons 


/80 


Of course, the error of the estimate is less than 1.22 or it is not, and 
we really don’t know which happens to be right, but if we had to bet, 
then 95 to 5 (or 19 to 1) would be fair odds that the error is less than 
1.22 tons. | 
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If we take the above inequality for E and solve it for n, we obtain 


Z4/2°O 7 
n< ee | 


and it follows that if we select the sample size n so that 


-f44] 
E 


we can assert with a probability of 1 — o that the error of estimating yu 
by means of x will be less than E. To be able to use this formula for cal- 
culating the sample size needed to estimate mw in a given situation, it is 
necessary that we specify a, o, and EZ. Thus, we must give not only the 
maximum tolerable error E and the population standard deviation a, 
but also the probability 1 — « with which we want to assert that the 
maximum error will be less than E. The population standard deviation is 
usually estimated with prior data of a similar kind, and sometimes a good 
guess will have to do. 





Example. An efficiency expert has to determine the average time it 
takes a mechanic to rotate the tires of a car, and he wants to be able 
to assert with a probability of 0.99 that the error of his estimate, 
the mean of an appropriate sample, will be off by less than half a 
minute. If it is known from past experience that it is reasonable to 
let o = 1.6 minutes, substitution into the formula yields 


72.58-1.67 
n= |S" — 68.2 


or 69 rounded up to the nearest integer. Thus, the efficiency expert will 
have to take a sample of size 69 (that is, time 69 mechanics while they 
are rotating the tires of a car). 


The methods we have discussed require that o be known or that it 
can be approximated with the sample standard deviation s, thus requiring 
that 7 be large. However, if it is reasonable to assume that we are sampling 
from a normal population, we can base our argument on Theorem 6.3 
instead of Theorem 6.2, namely, on the fact that> 


= 


s],f/ n 
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is a value of a random variable having the Student-t distribution with 
n— 1 degrees of freedom. Duplicating the steps on page 183, we thus 
arrive at the result that we can assert with a probability of 1 — o that 


E < la /2 . a 


r/ n 





where Fis the error we make in using x to estimate yw, and f,,, is as defined 
on page 174. 


Example. In 6 determinations of the melting point of tin, a chemist 
obtained a mean of 232.26 degrees centigrade with a standard devia- 
tion of 0.14 degrees. If he uses this mean as the melting point of tin, 
let us see what we can say with a probability of 0.98 about the possible 
size of his error. Substituting n = 6, s = 0.14 and 3.365 (the value of 
to.o, for 5 degrees of freedom) into the inequality for E, we get 


0.14 


E < 3.365 + = 0.19 


Thus, we can assert with a probability of about 0.98 that he is off 
by at most 0.19 degree. 


7.2 INTERVAL ESTIMATION 


Since point estimates cannot really be expected to coincide with the 
quantities they are intended to estimate, it is sometimes preferable to 
replace them with interval estimates, that is, intervals for which we can 
assert with a reasonable degree of certainty that they contain the pa- 
rameter under consideration. To illustrate the construction of such an 
interval, suppose that we have a random sample of size n, where 7 is large, 
from a population having the unknown mean yw and the known variance 
o”. Referring to the double inequality on page 183, namely, 


x — Hu 
a/./ n 


which we asserted with a probability of 1 — a, we can now apply simple 


me hy. 





< 2/2 
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algebra and rewrite it as 


=e 


X — Zein § am <M XH Za? 
/2 we Lt /2 


ase 
/ 1 





Thus, we can claim with a probability of 1 — « that the interval from 
oe} 
ee 
an interval of this kind as a confidence interval for u having the degree 

of confidence 1 — &. 








X — Zaj2° to X + Zy/2 ° a contains y. It is customary to refer to 


Example. At this point it is well to reconsider just what is meant 
by the statement “We can claim with a probability of 1 — @ that such 
an interval contains yu.” If a random sample of size n = 100 is taken 
from a population having o = 5.1 and we obtain a sample mean of 
21.6, then a 0.95 confidence interval for uw is given by 


5.1 

21.6 + 196-99? 

namely, the interval from 20.6 to 22.6. Since the mean of the given popu- 
lation either is contained or is not contained in this interval, it would 
hardly seem reasonable to speak of the probability of such an event. 
Indeed, what we really mean when we claim that the interval from 
20.6 to 22.6 is a 0.95 confidence interval for the mean of the popu- 
lation is that in repeated sampling 95 percent of the confidence intervals 
obtained with the above formula contain the means of the respective 
populations. Thus, although we shall never know whether the popula- 
tion mean is really contained in the interval from 20.6 to 22.6 in the 
given example, we do have the assurance that the method used to obtain 
the interval is 95 percent reliable, that is, it can be expected to work 
95 percent of the time. 


The formula obtained for a 1 — @ confidence interval for w has the 
unfortunate feature that it requires knowledge of the population standard 
deviation. It is “unfortunate” because in most practical problems a is 
unknown; therefore, we generally have little recourse but to replace o 
with an estimate in the hope that, at least for large samples, the resulting 
confidence interval will be a close approximation. Substituting for o the 
sample standard deviation s, which has desirable properties as a point 
estimate of o (see Chapter 8), we use the interval 
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KZ aie poke z 
a/2 ° Jn al, 
n a/ nh 


as an approximate /arge-sample confidence interval for u having the degree 
of confidence 1 — a. 


Example. To illustrate the use of this last formula, let us refer again 
to the sulfur oxides emission data on page 137, for which n = 80, 
x = 18.85 tons, and s = 5.55 tons. Substituting all these values 
together with z,,. = 1.96 into the formula, we find that a 0.95 con- 
fidence interval for yu, the plant’s actual average daily emission of 


sulfur oxides, is given by 18.85 + 1.96 - wi or 18.85 + 1.22, which 





can be written as 
17.63 < uw < 20.07 


Observe the close relationship between the width of the confidence interval 
obtained here and the maximum error asserted with a probability of 0.95 
on page 183. This kind of relationship will always exist so long as the 
sampling distribution of the statistic on which the methods are based is 
symmetrical. 

When the sample size n is not large enough to approximate o with s, 
we proceed as on page 185 provided it is reasonable to assume that we are 
sampling from a normal population. Thus, with t,,. defined as on page 174, 
we obtain the 1 — « confidence interval 


Eta om SSE tesa 


It applies to samples from normal populations, but in accordance with 
the discussion on page 175, it may be used so long as the sample does not 
exhibit any pronounced departures from normality. 


Example. If the mean weight loss of » = 16 grinding balls after a 
certain length of time in mill slurry is 3.42 grams with a standard 
deviation of 0.68 grams, a 0.99 confidence interval for yu is given by 


5.42 + 2.947 « iE - in other words, 2.92 < ¢ < 3.92. 
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7.3. BAYESIAN ESTIMATION 


In recent years, there has been mounting interest in methods of infer- 
ence which look upon parameters as random variables. This idea is not 
really new, but these Bayesian methods, as they are called, have received 
considerable impetus and much wider applicability through the concept 
of personal, or subjective, probability. In fact, this is why supporters of 
the personal concept of probability refer to themselves as Bayesians, or 
Bayesian statisticians. 

In this section we shall present a Bayesian method of estimating the 
mean yu of a population. Proponents of the subjective, or personal, point 
of view in probability look upon yw as a random variable whose distribu- 
tion is indicative of how strongly a person feels about the various values 
which wu can take on. In other words, they suggest that in any problem 
where we want to estimate the mean of a population, a person will feel 
most strongly about some particular value of uw, and that this enthusiasm 
will diminish for values of w which are further and further away from the 
one he likes the most. Like any distribution we use in actual practice, this 
kind of subjective prior distribution for the possible values of ~ has a mean 
which we shall denote 4, and a standard deviation which we shall denote 
CO. 


Example. To illustrate the concept of a prior distribution of the mean, 
let us refer again to the air pollution example on page 137, and let 
us suppose that before the sample data were actually obtained the chief 
engineer of the plant said that his feelings about the true average daily 
emission of sulfur oxides can best be described by a normal distribu- 
tion with the mean “Z, = 17.50 tons and the standard deviation go, = 
2.50 tons. Given this information, we could ask, for example, how he 
felt about the possibility that the plant’s true average daily emission 
of these pollutants is somewhere between 18.00 and 19.00 tons. The 
answer to this question is given by the area of the shaded region of 
the first diagram of Figure 7.1, and, as can easily be verified (see 
Exercise 20 on page 194), he would have assigned a probability of about 
0.15 to the mean actually lying between 18.00 and 19.00 tons. Note that 
the interval from 18.00 tons to 19.00 tons is not a confidence interval 
which the chief engineer could have asserted with a degree of con- 
fidence of 0.15. No, based on prior considerations, he would be making 
a direct probability statement about the mean of the population, namely, 
about the plant’s true average daily emission of sulfur oxides. 
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Figure 7.1. Prior and posterior distribution of the mean. 


In Bayesian estimation, prior information about the possible values 
of the population mean, as expressed in terms of a prior distribution, is 
combined with direct sample evidence consisting of a random sample of 
size n, its mean x, and its standard deviation s (which may have to serve 
as an estimate of o, the standard deviation of the population). Treating 
the sample mean as well as the population mean as random variables and 
using the notation introduced on page 129, this can be accomplished by 
referring to the conditional distribution of the population mean given 
x, whose values are written g,(u |x). Using the result of Exercise 10 on 
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page 131, g,(u|x) can be expressed in terms of f,(w), f,(x), and g(x | 1), 
where f;() is a value of the prior distribution of the mean of population 
and g,(x|) is a value of the sampling distribution of the mean for a 
given population mean w. If, as in our example, the prior distribution of 
the population mean is a normal distribution with the parameters jy 
and g,, and n is large enough to treat the sampling distribution of the 
mean as a normal distribution with the parameters uw and a/,/7n, it can 
be shown that the posterior distribution of the population mean (namely, 
the conditional distribution of the population mean given x) is also anormal 
distribution and that its mean and standard deviation are 


or 2 
NXO>o + Lod 
Hy, = — > *S = and o,= 
n0o + @ nai matte oO” 





Derivations of these results may be found in several of the textbooks on 
theoretical statistics listed in the Bibliography at the end of this book. 


Continuation of Example. Let us now combine the prior information 
given on page 188, that based on the feelings of the chief engineer, 
with the fact that a random sample of size n = 80 yielded a mean of 
x = 18.85 tons and a standard deviation of s = 5.55 tons. Using s 
as an estimate of o and substituting these values together with uw, = 
17.50 and o) = 2.5 into the above formulas for uw, and a,, we obtain 


_ 80(18.85)(2.5)? + 17.50(5.55)? __ 1g 47 
1~ "802.57 +655". 


and 


— 65.55"(2.5)2 


80(2.5)? + 6.552 vim 


oO; = 


Note that the mean of the posterior distribution, namely, uz, = 18.77, 
provides us with a point estimate of the mean of the population. It is 
based on direct evidence as well as prior information; in fact, it isa 
weighted mean (see page 158 and also Exercise 17 on page 193) of x 
and Ly. 

To continue with this example, let us now find the posterior prob- 
ability that the population mean lies on the interval from 18.00 to 19.00 
tons. This time the probability is given by the area of the shaded 
region of the second diagram of Figure 7.1, and, as the reader will be 
asked to verify in Exercise 20 on page 194, the answer is 0.55. Note 
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that knowledge about the sample with x = 18.85 increased the prob- 
ability associated with the interval from 18.00 to 19.00 tons from 
0.15 to 0.55. 


EXERCISES 


To illustrate that the mean of a random sample is an unbiased estimate of the 

mean of the population, consider five slips of paper numbered, respectively, 

3, 6, 9, 15, and 27. 

(a) List all possible samples of size 3 that can be taken without replacement 
from this finite population. 

(b) Calculate the mean of each of the samples listed in (a), and, assigning 
each sample a probability of ,4, verify that the mean of these x’s equals 


12, namely, the mean of the population. 

Fo verity the claim that the mean is generally more efficient than the median 
(namely, that it is subject to smaller chance fluctuations), a student conducted 
an experiment consisting of 12 tosses of three dice. The following are his 
results: 2, 4, and 6; 5, 3, and 5; 4, 5, and 3; 5, 2, and 3; 6, 1, and 5; 3, 2, and 
1; 3,1, and 4; 5, 5, and 2; 3, 3, and 4; 1, 6, and 2; 3, 3, and 3; 4, 5, and 3. 
(a) Calculate the twelve medians and the twelve means. 

(b) Group the medians and the means obtained in (a) into separate distribu- 
tions having the classes 1.5—2.5, 2.5—3.5, 3.5-4.5, and 4.5—S.5. 

(c) Draw histograms of the two distributions obtained in (b) and explain 
how they illustrate the claim that the mean is generally more efficient 


han the median. | 
(36 {While performing a certain task under simulated weightlessness, the pulse 


£’ .rate of 32 astronaut trainees increased on the average by 26.4 beats per 
ae? “minute with a standard deviation of 4.28 beats per minute. What can one 
assert with a probability of 0.95 about the possible size of the error, if this 
sample mean is used to estimate the true average increase in the pulse rate of 
astronaut trainees performing the given task ? 


se the data of Exercise 3 to construct a 0.95 confidence interval for the true 
4“ avefage increase in the pulse rate of astronaut trainees performing the given 
t 
o €stimate the average time it takes to assemble a certain computer com- 
ponent, the efficiency expert of an electronics firm timed 40 technicians in 
the performance of this task, getting a mean of 12.73 minutes and a standard 
deviation of 2.06 minutes. 
(a) What can we say with a probability of 0.99 about the possible size of the 
error if X = 12.73 minutes is used as an estimate of the actual average 
time required to do the job? 
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(b) Use the data to construct a 0.98 confidence interval for the true average 
time it takes to assemble the computer component. 


6. Referring to Exercise 5, with what probability can we assert that the sample 
mean is within 30 seconds of the true mean? 


7. Find the largest error one can expect to make with a probability of 0.90 when 
using the mean of a random sample of size n = 64 to estimate the mean of a 
population having a variance of 2.56. 


Ye \ 


/%./ In a study of automobile collision insurance costs, a random sample of 80 
¥ body repair costs on a particular kind of damage had a mean of $472.36 and 
~ a standard deviation of $62.35. What is the probability that an error of no 
re than $10 is made when estimating the true average repair cost of this 
ind of damage as $472.36? 


9. If we wanted to determine the average mechanical aptitude of a large group 
ae workers, how large a random sample would we need to be able to assert 
with a probability of 0.95 that the sample mean will be within 3 points of the 
true mean? Assume that it is known from past experience with similar data 
that 0 = 20. 


/10.) ) Referring to Exercise 8 and using $62.35 as an estimate of o, how large a 
\/ " sample would be needed to be able to assert with a probability of 0.99 that 
/the sample mean will be within $10 of the true mean? 










~ In the example on page 174, twenty fuses were subjected to a 20 percent 
overload, and the times it took them to blow had a mean of 10.63 minutes 
and a standard deviation of 2.48 minutes. Based on these data, what can we 
say with a probability of 0.95 about the possible size of our error, if we use 
10.63 minutes as an estimate of the true average time it takes such fuses to 
blow with a 20 percent overload? 


12.Given that 10 bearings made by a certain process have a mean diameter of 
-+ V9:5060 cm and a standard deviation of 0.0040 cm (see Exercise 3 on page 
79), construct a 0.99 confidence interval for the actual average diameter of 
/ bearings made by this process. 







¥3. Inspecting ceramic tiles prior to their shipment, a quality control engineer 
detects 2, 3, 6, 0, 4, and 9 defectives in six cartons, each containing 144 tiles. 
What can we assert with a probability of 0.99 about the possible size of his 
error, if he uses the mean of this sample to estimate the true average number 
of defective tiles per carton? 


14. In an air pollution study, the following amounts of suspended benzene- 
soluble organic matter (in micrograms per cubic meter) were obtained at an 
experiment station for eight different samples of air: 2.2, 1.8, 3.1, 2.0, 2.4, 
2.0, 2.1, and 1.2. Construct a 0.95 confidence interval for the corresponding 
true mean. 


15. In order to test the durability of a new paint, a highway department had test 
“~~ strips painted across heavily traveled roads in 15 different locations. If on 
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the average the test strips disappeared after they had been crossed by 146,692 
cars and the standard deviation is 14,380 cars, construct a 0.99 confidence 
interval for the true average number of cars it will take to wear off this paint. 


16. Referring to Exercise 15 and using 14,380 as an estimate of @, find the sample 
size that would have been necessary to be able to assert with a probability of 
0.95 that the sample mean will not be off by more than 10,000. [Hint: First, 
estimate n, by using z = 1.96, then use fp 925 with wm, — 1 degrees of freedom 
to obtain a second estimate 1; repeat this procedure until the last two values 


Patho obtained are equal.] 
17. Show that the expression for the mean of the posterior distribution on page 
190 can be written as 


1 


na 


o2 " Ho 


-x+—5 a 
= : 7 
a2 a 


ciel as a weighted mean of x and fo, with their respective weights being 
= and — ;. Indicate how these weights depend 

(a) on re sample size n; 

(b) on the variability of the population; 

(c)/ on the variability of the prior distribution. 


13“A sales manager’s feelings about the average monthly demand for one of his 
company’s products may be described by means of a normal distribution 
with J4) = 3,800 units and 0) = 260 units. 

(a) What probability does he, thus, assign to the true average (monthly 
demand) being somewhere on the interval from 3,500 to 4,000 units ? 

(b) If data for nine months show an average demand for 3,702 units with a 
standard deviation of 390 units, how would the original point estimate 
of Lo = 3,800 units be modified in the light of this information ? 

(c) How would the probability of part (a) be modified in the light of the 
information given in part (b)? 


19. A distributor of soft-drink vending machines knows that in a supermarket 
one of his machines will sell on the average 835 drinks per week. Of course, 
this mean will vary somewhat from market to market, and this variation is 
measured by a standard deviation of 12.4. So far as a machine placed in a 


particular market is concerned, the number of drinks sold will vary from . . 


week to week, and this variation is measured by a standard deviation of 43.6. 

(a) If this distributor plans to put one of his soft-drink vending machines 
into a brand new supermarket, what estimate would he use for the num- 
ber of drinks he can expect to sell per week ? 

(b) How would he modify his estimate if during 10 weeks the machine sells 
on the average 592 drinks per week? 

(c) How would he modify his original estimate if during 50 weeks the 
machine sells on the average 917 drinks per week? 
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20. Verify that the probabilities represented by the shaded regions of the two 
diagrams of Figure 7.1 are, respectively, 0.15 and 0.55. 


7.4 TESTS OF HYPOTHESES 


There are many problems in which we are not directly concerned with 
the actual value of a parameter; instead we want to know whether its 
value exceeds a given number, is less than a given number, falls into a 
given interval, and so on. Rather than estimate the value of a parameter, 
we thus want to decide whether a statement (or statements) concerning 
the parameter is true or false; that is, we want to test a hypothesis about 
the parameter. For example, in quality-control work a random sample 
may be taken to determine whether the “process mean” (for a given kind 
of measurement) has remained unchanged or whether it has changed to 
such an extent that the process has gone “out of control” and adjustments 
will have to be made. 


Example. To illustrate the general concepts involved in this kind of 
decision problem, suppose that a paint manufacturer claims that the 
average drying time of his new “fast-drying” paint is 20 minutes, and 
that a government agency wants to test the validity of this claim. 
Suppose, furthermore, that 36 boards painted, respectively, with paint 
from 36 different one-gallon cans of this paint dried on the average in 
20.75 minutes. This figure exceeds the average of 20 minutes claimed 
by the manufacturer, but is it large enough to reject the claim; that 
is, is this sufficient evidence to take appropriate action against the 
paint manufacturer? Js it not possible that the discrepancy is due 
entirely to chance and that the paint is as good as claimed even though 
the sample mean turned out to be high? 

In testing the paint manufacturer’s claim, the government agency 
is faced with the problem of establishing a criterion which will enable 
it to take suitable actions. Surely, if the true average drying time of 
the “fast-drying” paint is 20 minutes or less, there is no cause for any 
kind of regulatory action. On the other hand, how much leeway should 
be allowed to the manufacturer if his product is not quite as good as 
claimed? Should some action be taken if the true mean (drying time) 
fis greater than 20.5 minutes, 21 minutes, 21.5 minutes, and so forth? 
Suppose, for the sake of argument, it is decided that a practical 
threat to construction planning exists if the average drying time 
exceeds 21 minutes. We can then think of the set of all possible values 
of uw (the set of the positive real numbers) as being divided into the 
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Figure 7.2. Decision intervals. 


three regions shown in Figure 7.2. If w lies in the rejection interval, 
some regulatory action should be taken and it would be a serious 
error not to reject the paint manufacturer’s claim; if yw lies in the 
acceptance interval, there is certainly no cause for any action and it 
would be a serious error if, by chance, the agency did reject the 
manufacturer’s claim; if yu lies in the indifference interval, it is difficult 
to argue whether or not regulatory action should be taken and no 
serious error is made in either case. 

If uw could actually be known, the decision-making problem faced 
by the government agency would be resolved by simply referring to 
the criterion of Figure 7.2. Since this is not the case in the given 
problem (or any similar practical situation), the decision will have 
to be based instead on the results of a random sample, usually on the 
value obtained for the sample mean. Thus, it is important to realize 
that x might be less than 20 even though yu exceeds 21, and that x 
might exceed 21 even though yw is less than 20. 


This last argument illustrates the fact that errors are unavoidable when 
decisions are based on the results of random samples and, furthermore, 
that these errors are of two different kinds. The sample data can lead 
to the rejection of the claim even though y lies in the acceptance interval, 
and they can lead to the acceptance of the claim even though y lies in the 
rejection interval. Schematically, the situation may be described by means 
of the following table: 





Lt lies in Lt lies in 
acceptance rejection 
interval interval 
Accept the claim No error Type 
error 
Reject the claim Type I No error 


error 
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Note that if yw lies in the indifference interval, no serious error is committed 
whatever decision is made. 

To judge the merits of any decision criterion, it is essential to know 
the probabilities that it will lead to Type I and Type II errors, namely, 
the probabilities of erroneously rejecting or erroneously accepting a 
hypothesis (assumption or claim). Denoting these probabilities by « and 
f, that is, 

a = P (Type [ error) 


B = P (Type II error) 


it is important not only to determine the values of «% and f for given 
criteria, but to construct criteria for which « and f# take on preassigned 
values. 


Continuation of Example. To illustrate the first of these problems, 
suppose that the government agency proceeds as indicated on page 
194 (namely, uses the paint from 36 different one-gallon cans on 36 
boards) and decides to reject the paint manufacturer’s claim if x, the 
mean drying time in this sample of size n = 36, exceeds 20,75 minutes. 
To simplify the discussion, we shall assume also that the standard 
deviation of such drying times can be expected to equal o = 2.4 
minutes. Now @ is the probability of a false rejection, namely, the 
probability that x > 20.75 when actually w< 20. Thus, a cannot 
be determined unless some value of yw (less than or equal to 20) is 
specified, but it is easy to see that the probability of committing a Type 
I error with the given criterion is greatest when uu = 20. In fact, we 
could think of the acceptance interval as consisting only of the point 
jt = 20, in which case there would be only one possible value of a, 
namely, the probability of getting an xX greater than 20.75 when 
= 20. 
To calculate this value of a, we approximate the sampling distri- 
bution of x with a normal distribution having a mean of 20 and a 
2.4 
af 36 
the standard normal curve to the right of 
= 20.15 —~ 20 
0.4 


standard deviation of = (0.4. Thus, @ is given by the area under 





Eero 


and, as can be seen from Table 3, it equals approximately 0.03. 

So far as Type II errors are concerned, we can argue analogously 
that f is a maximum for uv = 21; that is, for all values of the rejection 
interval (wu > 21) the probability of a Type II error is greatest for 
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ft = 21. Thinking of the rejection interval as consisting only of the 
point w= 21, we can find f by approximating the sampling distribu- 
tion of x with a normal distribution having a mean of 21 and a stan- 





dard deviation of <5 = 0.4. Thus, # is given by the area under the 
standard normal curve to the left of 
_ 20.75 — 21 
zZ= =e = 0.625 


and, as can be seen from Table 3, it equals approximately 0.27. 


Using the method illustrated in this example, we can always calculate 
a and # for testing an hypothesis H, of the form w= yy against an 
alternative hypothesis H, of the form mw = yw, on the basis of a given 
criterion, so long as o is known and n is large enough to use the normal 
distribution to approximate the sampling distribution of the mean. Now 
let us turn to the other kind of problem mentioned on page 196, namely, 
that of constructing criteria for which « and f take on preassigned values. 
Actually, this involves determining the sample size n and also the “dividing 
line” of the criterion, which was 20.75 minutes in our example. If we denote 
this dividing line by the letter C, it can be seen from Figure 7.3 (where we 
arbitrarily let uw, > f) that 


G25 C= fp 


a ee d ies = — 
vA a] = an ZB ald 7 





Figure 7.3. Determination of decision criterion. 
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where z, and z, are such that the normal curve areas to their right equal 
a and f, respectively. Solving the first of these two equations for C, we get 


O 


C= M+ 2Z,° ry 


for the dividing line of the criterion; that is, we reject the hypothesis that 
[k= [ly and accept the alternative hypothesis that w= yw, if the mean 


; oO ; 
of a random sample of size n exceeds 4) + Z, + —=——. To determine n, we 


af th 


eliminate C from the equations for z, and —Z,, getting 


n= O*(Z4 = Zp)” 
(fi, — fo)? 


Continuation of Example. With reference to the drying-time example, 
suppose that the government agency is willing to risk Type I and 
Type II errors with respective probabilities of a = 0.05 and f = 0.10. 
Since Z) 9; = 1.645 and z, ,, = 1.28 according to Table 3, we find 
that the required sample size is 


y= (2-4)7(1.645 + 1.28)? 
- (21 — 20) 


or nm = 50 rounded wp to the nearest integer. Correspondingly, the 
dividing line of the criterion is 
2.4 
C = 20 + 1.645 - —_— = 20.56 
Tee 730 


Thus, the government agency can accomplish its goal by taking a 
random sample of size 50, accepting the paint manufacturer’s claim 
so long as the mean of the sample does not exceed 20.56 minutes, and 
rejecting the claim if the mean of the sample exceeds 20.56 minutes. 


In Section 7.5 we shall demonstrate another method, based on special 
tables, for handling problems of this kind. 

In testing an hypothesis of the form “4 = fo, where y, is a specified 
constant, against an alternative hypothesis of the form uw > uy, we used 
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the test criterion: reject the hypothesis u = My if xX > C, where C is usually 
determined so that the probability of committing a Type I error equals some 
preassigned value a. The set of values of x which, thus, leads to the rejec- 
tion of the hypothesis w = uy (and the acceptance of the alternative) is 
called the critical region; in our example it was the set of all real numbers 
greater than C. If the alternative hypothesis is of the form “ < fo, the 
general procedure is the same, but all inequalities are reversed, and the 
critical region becomes x < K, where K must again be chosen so that the 
probability of committing a Type I error equals some preassigned value 
a. This kind of test would arise, for example, if we wanted to investigate 
the claim that a certain kind of car will average 15 miles per gallon in 
city traffic, and we are concerned mostly about the possibility that the 
actual figure is less. 

So far, the tests we have discussed have been one-tail tests; that is, the 
hypothesis H, that “= My has been rejected for values of x falling into 
one “tail” of its sampling distribution. If we now consider the alternative 
LL ~ [lo, We would want to reject H, for values of x that are appreciably 
less than or greater than yy, and the resulting critical region would be 
of the form x < C,; or xX > C,. This kind of test would arise, for example, 
if we wanted to check whether the mean diameter of a certain kind of 
steel pipe is 3.0 inches, as specified. Clearly, there could be complications 
if the mean diameter is either too large or too small. 

The constants C, and C, for such a two-tail test are usually chosen so 
that the two “tails” are equal (that is, alternative values of uw at the same 
distance on either side of uw, have the same chance of being accepted) 
and the probability of committing a Type I error equals some preassigned 
constant «. As can be seen with the aid of Figure 7.4, the constants C, 
and C, may thus be obtained by solving the equations 


am Z@/2 


ol.jn a|./ 


and the results are 


Ci = fe. and Cr — Ho 
n 











oO 
C; = fo — 2a/2* va and C, = Ho am Zu/2 ° Wri 





Example. To illustrate, suppose that a process for making steel pipe 
is under control if the diameter of the pipe has a mean of 3.0000 
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Figure 7.4. Two-tail test criterion. 


inches with a standard deviation of 0.0250 inch. To check whether 
the process is under control, a random sample of size 30 is taken 
each day and the criterion is such that the probability of committing 
a Type I error is « = 0.05. To find C, and C, we substitute the given 
values together with Z).9,; = 1.96 into the formulas given above, and 
we get 


C= 30000 — 1.565 8" = 9.0911 


r/ 30 


and 
0.0250 
r/ 30 
Thus, the process is regarded to be out of control if x is less than 
2.9911 inches or greater than 3.0089 inches. 

The calculation of , the probability of a Type II error, is the 
same as before, and we shall illustrate it here for the case where 


L, = 3.0050. Referring to the sampling distribution of x with the 


mean “, = 3.0050 and the standard deviation aT = 0.0046 (see 


C, = 3.0000 + 1.96 - = 3.0089 





Figure 7.5), we shall have to determine the area under the curve 
between C, = 2.9911 and C, = 3.0089, and it will be left to the 
reader to verify in Exercise 3 on page 208 that the answer is approxi- 
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mately 0.80. Thus, if the true mean diameter of the steel pipe is 3.0050 
inches, the probability is about 0.80 that the given criterion will 


erroneously accept the hypothesis H, that the mean diameter is 
3.0000 inches. 





Figure 7.5. Determination of probability of Type II error. 


7.5 OPERATING CHARACTERISTIC CURVES 


In the drying-time example on page 194, the choice of H), namely, 
the choice of the hypothesis that 4 = 20 minutes, was dictated by the 
paint manufacturer’s claim; on the other hand, the choice of the alter- 
native hypothesis u = 21, for which we calculated the probability of a 
Type II error, was essentially arbitrary. Thus, it might be of interest to 
see what happens for other values of yw. To this end, we shall investigate 
the probability of accepting the hypothesis H, for any given value of y, 
denoted L(z), and it should be observed that this is the probability of 
committing a Type II error for all values of yu which should be rejected, and 
the probability of not committing a Type I error for all values of which 
should be accepted. 


Example. Continuing with the drying-time problem on page 196, 
where we had yu, = 20, o = 2.4, n = 36, and the dividing line of the 
criterion was C = 20.75, it can easily be verified that the methods 
used on page 197 yield the results shown in the following table and in 
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Figure 7.6 (see also Exercise 5 on page 208): 


Probability of 
Value of yu accepting Ho 
19.50 0.999 
19.75 0.99 
20.00 0.97 
20.25 0.89 
20.50 0.73 
20.75 0.50 
21.00 ~-0;2T- 
21.25 0.11 
21.50 0.03 
21.75 0.01 
22.00 0.001 


Note that the probability of committing a Type II error diminishes 
when yz is increased, and that the probability of not committing a 
Type I error approaches 1 when yu becomes smaller and smaller than 
20.00. 


= 


ae 





Figure 7.6. Operating characteristic curve. 
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The graph of L(y) for various values of , shown in Figure 7.6, is called 
the operating characteristic curve or, simply, the OC curve of the test 


criterion. Ideally, we should want to reject the hypothesis H, that u = My 
in favor of the alternative hypothesis H, that uw = “, > My when actually 
ut exceeds flo, and to accept it when actually yw is less than or equal to fp. 
Thus, the “ideal” OC curve for our example would be given by the heavy 
lines of Figure 7.6. In actual practice, OC curves can only approximate 
such an “ideal” curve, with the approximation becoming better as the 
sample size is increased. To illustrate this point, the reader will be asked 
to indicate in Exercise 6 on page 208 how the OC curve of the drying-time 
example would have been “improved” if the sample size had been n = 64, 
while keeping the values of “y, o, and @ fixed. 

Figure 7.6 presents the picture of a typical OC curve for the case where 
the alternative hypothesis is “4, > U). When the alternative hypothesis is 
Ly < fo, the OC curve becomes the mirror image of that of Figure 7.6, 
reflected about the (dashed) vertical line through y,. For a two-tail test, 
when the alternative hypothesis is uw 4 My, the OC curve will be as shown 
in Figure 7.7, and it should be observed that L(y) now is the probability 





Figure 7.7. Operating characteristic curve for two-tail test. 


of a Type II error for all values of wu except ; at Mp it is still the prob- 
ability of mot committing a Type I error. Due to the symmetry of this 
kind of OC curve about the (dashed) vertical line through sp, all problems 
relating to Type II errors can be answered on the basis of the values of 
L(u) for hu > Lo. 

In many practical situations, the probabilities of Type I errors can 
be determined directly from charts like those given in Table 8 at the 
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end of the book. They enable us to read off the value of B which cor- 
responds to given values of yo, 4,, 0,”, and a, and they are based on the 
assumption that the sampling distribution of x is a normal distribution; 
hence, they can be used when n is large or the population from which we 
are sampling has roughly the shape of a normal distribution. Charts (a) 
and (b) apply to one-tail tests with « = 0.05 or « = 0.01, which are the 
two values most often used in actual practice. They apply regardless of 
whether the alternative hypothesis is u > WU, Or LL < Lp, since the quantity 
which we plot on the horizontal scale is not yw (as in Figure 7.6) but 


da l= bol 
oO 


namely, the absolute value of the difference between yw and y, divided by 
o. The following are some examples which illustrate this method; note 
that we could not have used it for the problem dealing with the drying 
times since « equalled 0.03. 


Example. Suppose we want to investigate the claim that the sound 
intensity of certain vacuum cleaners may be looked upon as a random 
variable having a normal distribution with a mean of 75.20 decibels 
and a standard deviation of 3.6 decibels. Specifically, we shall want 
to test the hypothesis “) = 75.20 against the alternative hypothesis 
dt > 75.20 on the basis of measurements of the sound intensity of 
n = 15 of these machines; the probability of a Type I error is to be 
a% = 0.05. Using the formula on page 198, we find that the hypothesis 
Ly = 75.20 will be accepted so long as the mean of the sample does 
not exceed 
o) 3.6 
C= Uy + 24° a 75.20 + 1.645 - Ti 76.73 

and it may be of interest to know the probability that we would, thus, 
commit a Type II error when actually the mean sound intensity of the 
vacuum cleaners is 77.00 decibels. Since 


_ |77.00 — 75.20| __ 
d= ee. eee 0.50 
we mark this point on the horizontal scale of chart (a) of Table 8, 
and, going up vertically until we come to the point where the line is 
crossed by the OC curve for n = 15, we find that the corresponding 
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probability of a Type II error is about 0.39. Proceeding in the same 
way, it can also be shown that when uw = 76.00 decibels the prob- 


ability of a Type II error is about 0.76, and when uw = 78.00 decibels 
the probability of a Type II error is about 0.09. 


Example. To give an example where the alternative hypothesis is 
ft < fo, Suppose we want to check the claim that in city traffic the 
mileage a person will get per tankful with a certain kind of car may be 
looked upon as a random variable having a normal distribution with 
a mean of 15.0 miles per gallon and a standard deviation of 0.8 mile 
per gallon. Specifically, we shall want to test the hypothesis uz, = 15.0 
against the alternative hypothesis “4 < 15.0 on the basis of the mileage 
obtained in city traffic with a tankful of gas by nm = 10 of these cars; 
the probability of a Type I error is to be w« = 0.01. Using the formula 
for C on page 198 with —z, substituted for z,, we find that the hypo- 
thesis 4) = 15.0 will be accepted so long as the mean of the sample 
is not less than 


Ca(s0= 9332" 


af 10 


and it may be of interest to know the probability that we would, thus, 
commit a Type II error when actually the average mileage per tankful 
is 14.0 miles per gallon. Since 


f=! 25 


14,0 — 15.0| _. i 
0.8 

we mark this point on the horizontal scale of chart (b) of Table 8, 
and, going up vertically until we come to the point where the line is 
crossed by the OC curve for n = 10, we find that the corresponding 
probability of a Type II error is about 0.06. Proceeding in the same 
way, it can also be shown, for example, that when uw = 14.5 miles per 
gallon, the probability of a Type II error is about 0.64. 


Table 8 can also be used to solve problems like the one posed on page 
197, namely, that of determining the sample size which is required to 
test the hypothesis w= fy against the alternative hypothesis uw = y, 
while « and f take on preassigned values. 


Example. Referring again to the example on page 198, where we had 
Mo = 20, UW, = 21,0 = 2.4, 4 = 0.05, and # = 0.10, we first calculate 


qa lB! = 0.42 
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Then, locating the point corresponding to d = 0.42 and Bf = 0.10 on 
chart (a) of Table 8, we find that it lies barely above the OC curve 
for n = 50. Thus, a sample of size n = 50 will serve the stated purpose, 
and this agrees with the result obtained on page 198. 


For two-tail tests, that is, for tests of the hypothesis w = My against 
the alternative hypothesis 4 ~ fo, the procedure is the same as before, 
with the exception that we must now refer to charts (c) and (d) of Table 
8. Note that these charts contain only the right-hand halves of the 
respective OC curves (as shown in Figure 7.7 on page 203), since they are 
symmetrical about Lp. 


Example. To illustrate, let us verify the value of # obtained in the 
example on page 200 (dealing with the diameters of the steel pipe), 
where we wanted to test the hypothesis 4 = 3.0000 inches against 
the alternative hypothesis “4 + 3.0000 inches, with 0 = 0.0250, n = 
30, and « = 0.05. To determine the probability of a Type II error 
when yu = 3.0050 inches, we first calculate 


__ |3.0000 — 3.0050] _ 
d= 99250 = ee 


and mark this point on the horizontal scale of chart (c) of Table 8. 
Then, going up vertically until we come to the point where the line is 
crossed by the OC curve for n = 30, we find that the corresponding 
probability of a Type II error is just about 0.80. 

To continue with this example, suppose we want to know by how 
much we would have to increase the sample size so that # (for uw = 
3.0050) would be reduced to 0.50. Locating the point corresponding 
to d= 0.20 and £ = 0.50 on chart (c) of Table 8, we find that it 
lies just about on the OC curve for n = 75. Thus, it would suffice to 
increase the sample size from n = 30 ton = 75. 


The purpose of the discussion of this and the preceding sections has 
been to introduce some of the basic problems connected with the 
testing of statistical hypotheses. Although the methods we have presented 
ate objective—that is, two experimenters analyzing the same data 
under the same conditions would arrive at the identical results—their 
use does entail some arbitrary, or subjective, considerations. For instance, 
in the example on page 194 it was partially a subjective decision to “draw 
the line” between satisfactory and unsatisfactory values of uw at 21 minutes. 
It was also partially a subjective decision to use a sample of 36 one-gallon 
cans of the paint, and to reject the manufacturer’s claim for values of 
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x exceeding 20.75 minutes. Approaching the problem differently, the 
government agency investigating the paint manufacturer’s claim could 
have specified values of a and f, thus controlling the risks to which they 
are willing to be exposed. The choice of «, the probability of a Type I 
error, could have been based on the consequences of making that kind 
of error, namely, the manufacturer’s cost of having a good product con- 
demned, the possible cost of subsequent litigation, the manufacturer’s 
cost of unnecessarily adjusting his machinery, the cost to the public of 
not having the product available when needed, and so forth. The choice 
of £, the probability of a Type II error, could similarly have been based 
on the consequences of making that kind of error, namely, the cost to 
the public of buying an inferior product, the manufacturer’s savings in 
using inferior ingredients but loss in good will, again the cost of possible 
litigation, and so forth. It should be obvious that it would be extremely 
difficult to put “cash values” on all these eventualities, but they must 
nevertheless be considered, at least indirectly, in choosing suitable criteria 
for testing statistical hypotheses. 

In recent years, attempts have been made to incorporate all these 
matters within a formal theory called decision theory. Although many 
important advances have been made, it should be recognized that such a 
theory does not eliminate the arbitrariness, or subjectiveness, discussed 
above; it merely incorporates these matters within the theory. This means 
that the use of decision theory requires that we actually put cash values 
on all possible consequences of our decisions. Although this has the advan- 
tage that it makes the experimenter (the engineer) more “cost conscious,” 
it also has the disadvantage of requiring information which very often 
cannot be obtained. 

In this text we shall discuss mainly the Neyman-Pearson theory, also 
called the classical theory of testing hypotheses. This means that we shall 
consider cost factors and other considerations that are partly arbitrary 
and partly subjective only insofar as they will affect the choice of a sample 
size, the choice of an alternative hypothesis, the choice of w and f, and so 
forth. 


> E E 
- XERCISES 


1. If the criterion on page 196 is modified so that the paint manufacturer’s 
claim is rejected for x > 20.50 minutes, find 
(a) the probability of a Type I error; 
(b) the probability of a Type II error when = 21 minutes. 
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2. 


Suppose that in the gasoline mileage problem on page 205, where we had 
{Mo = 15.0 miles per gallon, the alternative hypothesis w < 15.0, 0 = 0.8, 
and n = 10, the hypothesis ~ = 15.0 is rejected for x < 14.2. Find 

(a) the probability of a Type I error; 

(b) the probability of a Type II error when yu = 14.0. 


yim that the area of the shaded region of Figure 7.5 is approximately 0.80. 


. Suppose that in the problem on page 199 dealing with the manufacture of 


5. 


steel pipe, where we had My) = 3.0000 inches, the alternative hypothesis 
jt € 3.0000 inches, o = 0.0250 inch, and nm = 30, the hypothesis Mo = 
3.0000 is rejected if x is less than C, = 2.9930 or greater than C, = 3.0070. 
Find 

(a) the probability of a Type I error; 

(b) the probability of a Type IJ error when ww = 3.0050. 







With reference to the drying-time example, verify the values of the OC curve 
given in the table on page 202. 


Suppose that in the drying-time example on page 196, is changed from 36 

to 64, while the other quantities remain My) = 20, 0 = 2.4, and & = 0.03. 

Find 

(a) the new dividing line of the test criterion; 

(b) the probabilities of Type II errors for the values of y in the table on page 
202, and plot the graph of the corresponding OC curve. 

Compare the OC curve obtained here with the one given on page 202. 


7.) It is desired to test the hypothesis 4 = 0 against the alternative “4 > 0 on 


ns! 


j 
y 
UL 


\ af 


the basis of a random sample of size 9 from a normal population with the 
variance o2 = 1. If the probability of a Type I error is to be & = 0.01, 

(a) verify that the region of rejection is x > 0.78; 

») calculate B for uw = 0.5, 1.0, and 1.5, and draw a rough sketch of the OC 
/ curve. 

‘Also, use Table 8 to verify the values obtained for f in part (b). 






It is desired to test the hypothesis 4) = 40 against the alternative hypothesis 
[1 = 42 on the basis of a random sample from a normal population with 
the standard deviation o = 4. If the probability of a Type I error is to be 
0.05 and the probability of a Type II error is to be 0.24, find the required 
size of the sample by using 

(a) Ahe formula on page 198; 

(by Table 8. 





Ftd a 
4 OL 
fy 
ee 


9*It is desired to test the hypothesis 4) = 100 against the alternative hypothe- 


sis £4 100 on the basis of a random sample from a normal population with 
the standard deviation o = 16. If the probability of a Type I error is to be 
0.01 and the probability of a Type II error is to be 0.20 when yu = 92, find 
the required size of the sample by using 

(a) the formula on page 198; 

(b) Table 8. 
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fo; If a random sample of size n = 6 is used to test the hypothesis that the mean 
of a normal population with o =4 is WM) = 50 against the alternative 
hypothesis that uw > 50, and & =0.01, use Table 8 to determine the 
probability of committing a Type II error when 


(a) w = 52: (b) w = 53; 
(c) w = 54; (d) w = 55; 
(ce) w= 56; (f) w= 57. 


11. Repeat Exercise 10, using the alternative hypothesis 44 + 50 instead of the 
alternative hypothesis “Z > 50. 


e If arandom sample of size n = 8 is used to test the hypothesis that the mean 

of a normal population with @ = 20 is fy = 200 against the alternative 
hypothesis “4 ~ 200, and & = 0.05, use Table 8 to determine the prob- 
ability of committing a Type II error when 


(a) w= 190; (b) w= 185; 
(c) “= 180; (d) w= 175; 
(e) uw = 170. 


13. Repeat Exercise 12, using the alternative hypothesis 4 < 200 instead of the 
alternative hypothesis “4 ~ 200. 


7.6 NULL HYPOTHESES AND 
SIGNIFICANCE TESTS 


There are many situations in which the hypothesis w= yl, is chosen 
in such a way that we are willing to “reserve judgment” about its validity 
unless there is clear evidence that leads to its rejection. Thus, set up as a 
“straw man” with the objective of determining whether or not it can be 
rejected, we refer to such an hypothesis H, as a null hypothesis. 

The idea of setting up a null hypothesis is not an uncommon one, even 
in nonstatistical thinking. In fact, this is exactly what is done in an Amer- 
ican court of law, where an accused is assumed to be innocent unless 
he is proven guilty “beyond a reasonable doubt.” The null hypothesis 
states that the accused is not guilty, and the probability expressed sub- 
jectively by the phrase “beyond a reasonable doubt” reflects the probability 
a of risking a Type I error. Thus, the “burden of proof” is always on the 
prosecution in the sense that the accused is found not guilty unless the 
null hypothesis of innocence is clearly disproved. Note that this does not 
imply that the defendant has been proved innocent if found not guilty; 
it implies only that he has not been proved guilty. Of course, since we 
cannot legally “reserve judgment” if proof of guilt is not established, the 
accused is freed and we act as if the null hypothesis of innocence were 
accepted. Note that this is precisely what we may have to do in tests of 
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statistical hypotheses, when we cannot afford the luxury of reserving 
judgment. 

To establish a parallel between this argument and the kind of practical 
problem in which tests of null hypotheses are ordinarily applied, let us 
consider the following examples: 


Example. Suppose that a decision has to be made whether to buy a 
new automatic stamping machine, and it is felt that the machine 
would be economical only if uw, the average number of acceptable 
pieces produced per hour, were greater than 1,000. Thus, we have a 
situation which calls for a test of the null hypothesis w = 1,000, but 
we cannot tell immediately whether to use the alternative hypothesis 
f& < 1,000 or the alternative hypothesis w > 1,000. If the burden 
of proof is to be placed on the rather expensive machine, it would be 
appropriate to use the alternative hypothesis uw > 1,000, installing the 
machine only if the null hypothesis can be rejected on the basis of sample 
data. Note that if « = 0.05 for this test, there is only a 5 percent chance 
of erroneously rejecting the null hypothesis and installing the new 
stamping machine. On the other hand, if the burden of proof is to be 
placed on the existing machine (which, for some other reason, is 
looked upon with disfavor), it would be appropriate to use the alter- 
native hypothesis “ < 1,000, installing the machine so long as the null 
hypothesis cannot be rejected on the basis of sample data. In this case, 

— 0.05 would be the probability of erroneously rejecting the null 
hypothesis and not installing the new machine. The choice of the 
appropriate alternative hypothesis in this and in similar situations 
is a practical rather than a statistical problem; as we have indicated, 
it depends on where we wish to place the burden of proof. 


Example. To give an example in which we would use the alternative 
hypothesis ww ~ My, where fy is the value assumed under a null 
hypothesis, suppose that a food processor wants to check whether 
the correct amount of instant coffee goes into his 4-ounce jars. Since 
the label reads “4 ounces,” the food processor cannot afford to put 
much less than 4 ounces into each jar for fear of losing customer 
acceptance or running afoul of the law, nor can he afford to put much 
more than 4 ounces in each jar for fear of losing part of his profit. Thus, 
the food processor is concerned with the alternative hypothesis 
yu # 4, and the production process will be left undisturbed unless the 
null hypothesis w= 4 can be rejected on the basis of sample data. 


Tests of the null hypothesis w= “4, may reasonably be described as 
tests of whether x is significantly less than Ly, significantly greater than 
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Ho, or significantly different from fl), depending on the alternative hypoth- 
esis with which we are concerned. By “significantly less,” “significantly 


greater,” and “significantly different” we mean that the discrepancy 
between x and sy (namely, the difference between what we get and what 
we expect) is too large to be reasonably attributed to chance. Thus, we 
refer to a test of a null hypothesis as a test of significance, or a significance 
fest. 


Example. Suppose we look at the example on page 204 as a signifi- 
cance test of the null hypothesis that the average sound intensity 
of the vacuum cleaners is “ = 75.20 decibels, and that the alternative 
hypothesis is again w > 75.20. Then, if a random sample of size 15 
has the mean x = 77.12, which exceeds C = 76.73 (the “dividing 
line” obtained on page 204), we say that the difference between x = 
77.12 and Uy = 75.20 is significant when « = 0.05; in other words, 
we conclude that the null hypothesis “) = 75.20 will have to be 
rejected. Note, however, that if we had been willing to risk a Type I 
error with a probability of « = 0.01 instead of 0.05, the dividing line 
of the criterion would have been 


Cs 150 0994 


eat: 


and we would have had to conclude that the difference between x = 
77.12 and My = 75.20 is not significant when « = 0.01. 


THis illustrates the fact that the significance of a difference between x and 
[fo may well depend on @, the probability of a Type I error, which in this 
connection is referred to as the Jevel of significance. Thus, in the above 
example, the null hypothesis “4. = 75.20 decibels can be rejected at the 
level of significance « = 0.05, but vot at the level of significance « = 0.01. 
Whether we choose « = 0.05, 0.01, or some other value, in any given prob- 
lem, will have to depend on the risks, or consequences of committing a Type 
I error. 


7.7 WYPOTHESES CONCERNING ONE MEAN 


Let us now examine the general problem of testing the null hypoth- 
esis Ld = Ly against one of the one-sided alternatives u< My OF L> Ly; 
or against the two-sided alternative & ~ Uo. Considering first the case 
where we are sampling from a normal population and a is known, or the 
sample is large, so that the sampling distribution of x is approximately 
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a normal distribution and o can be estimated in terms of the sample 
standard deviation, we find that this problem has already been solved in 
Section 7.4. Given fo, o, , and @, the critical region is X > C when the 
alternative hypothesis is “ > fy, X < C’ when the alternative hypothesis 
Is f@< My, and x <C, or X > C, when the alternative hypothesis is 
jt ~A fy, Where the formula for C is on page 198, the formula for C’ is 
like the one for C with —z, substituted for z,, and the formulas for C; 
and C,, are on page 199. An equivalent, but simpler, method of specifying 
the critical region is to base it on the statistic 





instead of x. If z, is, as before, such that the area under the standard 
normal curve to its right equals «, the critical regions for testing the null 
hypothesis H,, namely, the hypothesis w= fo, can be expressed as in 
the following table: 


CRITICAL REGIONS FOR TESTING Ho: “ = Uo 
(Normal Population and o Known, or Large Sample) 


Alternative 
hypothesis Reject Ho if 


L< Lo Z< Zz 


> Lo Ze Bey 


Z, =. —Zq,/2 


LF Ho 





Example. To illustrate, let us return to the problem concerning the 
automatic stamping machine. The null hypothesis is ~ = 1,000 and, 
putting the burden of proof on the new machine, we shall use the 
alternative hypothesis “> 1,000. Suppose, furthermore, that the 
decision is to be based on a sample of n = 64 independent trials (each 
of which contains one hour’s production), and that the mean and 
the standard deviation of the number of acceptable pieces per trial 
are, respectively, x = 1,028 and s = 126. Although o is not actually 
known, the sample is large enough to approximate it with s = 126, 
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and we thus obtain 


1,028 — 1,000 
Soe ie 
* ~ ~"126/,/64 


If the level of significance is to be w = 0.05, we find from Table 3 
that the dividing line of the criterion, the critical value, is Z)55 = 
1.645, and since z = 1.78 is greater than 1.645, we find that the null 
hypothesis can be rejected at the 0.05 level. Thus, we would decide 
that the new machine is to be installed. In Exercise 4 on page 222 the 
reader will be asked to determine what action would be taken if we 
had used «& = 0.01 instead, and the probability of the Type II error we 
would be risking with this criterion when actually uw = 1,050 for the 
automatic stamping machine. “xe © 


If the sample size is small and o is unknown, the tests just described 
cannot be used. However, if the sample comes from a normal population 
(to within a reasonable degree of approximation), we can make use of 
the theory discussed in Section 6.3 and base the test of the null hypothesis 
[= Ly On the statistic 





The resulting critical regions are as shown in the following table, where ¢, 
is as defined on page 174; that is, the area to its right under the ¢ distribu- 
tion with n — | degrees of freedom is equal to a: 


CRITICAL REGIONS FOR TESTING Ho: = Lo 
(Normal Population, o Unknown) 


Reject Ho if for 


Alternative n — 1 degrees 
hypothesis of freedom 


ji = Mo rs te 


UF Lo 
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Example. To illustrate, let us reconsider the problem of deciding 
whether changes have to be made in the machinery that fills the jars 
of instant coffee (see page 210), namely, the problem of testing 
the null hypothesis uw = 4 ounces against the alternative hypothesis 
wu ~ 4 ounces. Suppose that the level of significance is to bea = 0.01, 
and that the net weights of the contents of a sample of 25 jars have 
a mean of x = 3.97 and a standard deviation of s = 0.04 ounce. To 
decide whether the machinery will have to be adjusted, we calculate 


3.97 — 4 

t= 0.04),/25 == + 3.15 
and since this is less than —2.797, the value of —fp.995 with 24 degrees 
of freedom (see Table 4), the null hypothesis will have to be rejected. 
Note that in spite of this result, the food processor may not wish to 
adjust his machinery, since the actual loss due to underfilling the jars 
by a very small amount may be less than the cost of experimenting 
with adjustments. This illustrates the important point that a result 
which is statistically significant may not be commercially significant. 
Such factors are considered in decision theory (see page 207), and 
under the circumstances it might be more appropriate to test the null 
hypothesis 4 = 4 against an alternative such as w < 3.95 or uw > 4.05, 
if it is felt that in either case there is cause for an adjustment of the 
machinery. 


Had the sample data yielded x = 4.02 and s = 0.04 in this example, 
t would have equalled 2.50, the null hypothesis could not have been 
rejected, and the food processor would have risked committing a Type II 
error by not adjusting his machinery. The probabilities of committing 
Type II errors when using the ¢ statistic are difficult to calculate, but 
there exist special charts, analogous to those of Table 8, which give 
the OC curves for various values of n; they may be found, for example, 
in National Bureau of Standards Handbook 91, listed in the Bibliography 
at the end of this book. 


7.8 HYPOTHESES CONCERNING TWO 
MEANS 


When dealing with population means, we are frequently faced with 
the problem of making decisions about the relative values of two or 
more means. Leaving the general problem until Chapter 12, we shall 
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devote this section to tests concerning the difference between two means. 
For example, if two methods of welding are being considered for use 
with railroad rails, we may take samples and decide which is better by 
comparing their mean strengths; also, if a licensing examination is given 
to engineers who graduated from two different colleges, we may want to 
decide whether any observed difference between the means of the scores 
of the students from the two colleges is significant or whether it may be 
attributed to chance. 

Formulating the problem more generally, we shall consider two 
populations having the means mu, and yw, and the variances a? and a2, 
and we shall want to test the null hypothesis u4, — uw, = 6, where O is 
a specified constant, on the basis of independent random samples of size 
n, and n,. Analogous to the tests concerning one mean, we shall consider 
tests of this null hypothesis against each of the alternatives uw, — u, < 0, 
i; — Hl, > 0, and “, — UW, 40. The test, itself, will depend on the 
difference between the sample means, x, — x,, and if both samples come 
from normal populations with known variances (or both samples are 
large), it can be based on the statistic 


ps a) =O 


O2z,-Ze 


whose sampling distribution is (or is approximately) the standard normal 
distribution. Here o;,_;, is the standard deviation of the sampling dis- 
tribution of the difference between the sample means, and its value for 
random samples from infinite populations may be obtained with the use 
of the following theorem, which we shall state without proof: 


THEOREM 7.1. If the distributions of two independent random vari- 
ables have the means 1, and 1, and the variances a7 and a}, then the 
distribution of their sum (or difference) has the mean ML, + [, (or 
LL, — [,) and the variance o% + 0}. 


To find the variance of the difference between the means of two 
independent random samples of size n, and n, from infinite populations, 
note first that the variances of the two means, themselves, are 


2 2 

o or 
o2,=— and o?,=— 
ny N2 


where a? and a are the variances of the respective populations. Thus, by 
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Theorem 7.1 


2 ia Og 
Ox,—-%: a a ee 


and the test statistic can be written as 


/_Gi-%,)—6 
2 2 
Oi | G2 
Nn, Ny 


Analogous to the table on page 212, the critical regions for testing the 
null hypothesis “; — MW, = 6 are as follows: 


CRITICAL REGIONS FOR TESTING Ho: “1 — 2 = 0 
(Normal Populations with 0; and ¢2 Known, or Large Samples) 


Alternative 
hypothesis Reject Ho if 
Hi — 2 > Z> Zo 
<< ——Zaeld 
= noe re) Zz Ot 


Example. To illustrate this procedure, suppose that a company 
claims that its light bulbs are superior to those of its main competitor 
on the basis of a study which showed that a sample of n, = 40 of its 
bulbs had an average “lifetime” of 647 hours of continuous use with 
a standard deviation of 27 hours, while a sample of n, = 40 bulbs 
made by its main competitor had an average “lifetime” of 638 hours of 
continuous use with a standard deviation of 31 hours. If we wish to 
test at the 0.05 level of significance whether the observed difference of 
9 hours between the two sample means is significant or whether it can 
be attributed to chance, placing the burden of proof on the claim, 
the appropriate null and alternative hypotheses are, respectively, 
Mk, — UM, =O and uw, — “, > 0. Accordingly, we put 6 = 0 in the 
formula for z and the test statistic becomes 


, — 647 — 638 
27 30 
40 * 40 


= 1.38 
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(Note that we have approximated the population variances with the 
variances of the respective samples, which is justifiable since both 
samples are fairly large.) Since the value which we obtained does not 
exceed the critical value z, >; = 1.645, we find that the null hypothesis 
cannot be rejected; thus, we conclude that the observed difference 
between the two sample means is not significant at the level of signifi- 
cance « = 0.05 or, in other words, that it can well be attributed to 
chance. 


If we actually wanted to accept the null hypothesis that the average 
lifetimes of the two kinds of light bulbs are equal, we would be risking 
Type II errors, for which the probabilities would depend on the actual 
differences 6’ = 4, — pL, between the two population means. Fortunately, 
these can be determined with the use of Table 8 (so long as we are 
sampling from normal populations with known standard deviations or 
both samples are large). The quantity which we mark on the horizontal 
scale is 


ga ld =o" 
/o% + o3 





where 6 is the value of “4, — , assumed under the null hypothesis and 6’ 
is the alternative value of “, — “, with which we are concerned. If 
nN, =n, =n, the probability of a Type IJ error is read off the OC curve 
corresponding to this value of n; if n, ~n,, it is read off the OC curve 
corresponding to 


y= Cio 
Z 2 

01 ts 02 

ny No 


Continuation of Example. Suppose we are concerned about the 
possibility of not rejecting the null hypothesis 6 = 0, when actually 
the average lifetime of the first kind of light bulb exceeds that of the 
second kind of light bulb by 6’ = 16 hours. Substituting into the 
formula for d, we obtain 


7 0 = 16) 


ee aa de Yc 
27 + 31? un 
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and, marking this point on the horizontal scale of chart (a) of Table 
8 and, going up vertically until we come to the OC curve for n = 40, 
we find that the corresponding probability of a Type II error is approxi- 
mately 0.20. 


If either (or both) samples are small and the population variances are 
unknown, we can base tests of the null hypothesis uw, — u,=06 ona 
suitable ¢ statistic, provided it is reasonable to assume that both populations 
are normal with o, = o,. Under these conditions it can be shown that the 
sampling distribution of the statistic 


» = (41 — 2) — 0 


Sx - Xp 


is the ¢ distribution with n, + n, — 2 degrees of freedom. In this formula 
the denominator involves a “pooled estimate” of the population variance. 

To clarify what we mean here by a “pooled estimate” of the popula- 
tion variance, let us first consider the problem of estimating the variance 
of the distribution of the difference between two sample means. Under 
the assumption that oj = 0} (= a”), this variance is given by 


2 2 
os C4 Oz af 1 I ) 
3, = 24 Ba o(— + = 
; ny + No (5 r Ny 
and we now estimate oa” by “pooling” the two sums of squared deviations 
from the respective sample means. In other words, we estimate o? by 
means of 


Sp DOG = a) = D5 Oe — De 
n, +n, —2 Ry “tg — 2 


where > (x, — X,)* is the sum of the squared deviations from the mean for 
the first sample, while >) (x, — X,)* is the sum of the squared deviations 
from the mean for the second sample. We divide by n, + n, — 2, since 
there aren, — 1 independent deviations from the mean in the first sample, 
n, — 1 in the second, and we thus have n, + n, — 2 independent devia- 
tions from the mean to estimate the population variance. Substituting this 
estimate of o? into the above expression for o%,_;, and then substituting the 
square root of the result into the denominator of the formula for f 
above, we finally obtain 


om (i; = Fa) 0 n1n2(n, + nN, — 2) 


JA(n, — Ist + Gn — 1)s3 Ny + Ny 
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for the statistic on which we shall base the test. The corresponding critical 
regions for testing the null hypothesis H,: “, — “, = 6 are as shown in 
the following table: 


CRITICAL REGIONS FOR TESTING Ho: My — Lz = 6 
(Normal Populations, ¢; = ¢2 = a, o Unknown) 


Reject Ho if for 
Alternative ny +n, —2 
hypothesis degrees of freedom 


Ly — U2 <6 t< —te 


iti =— a > 0 


ee 





Example. To illustrate this kind of test, let us consider the following 
measurements of the heat-producing capacity of the coal produced 
by two mines (in millions of calories per ton): 


Mine 1: 8,260, 8,130, 8,350, 8,070, 8,340 
Mine 2: 7,950, 7,890, 7,900, 8,140, 7,920, 7,840 


The means of these two samples are, respectively, 8,230 and 7,940, 
whose difference is quite large, but it remains to be seen whether it is 
significant, namely, whether it will enable us to reject the null hypoth- 
esis , — £4, =0 against the alternative hypothesis uw, — u, ~ 0, 
say, at the level of significance a = 0.05. Calculating first the two 
quantities (x, — 1)s? and (m, — 1)s%, we get 


>) (x, — ¥,)* = 30? + 1007 + 1207 + 1602 + 110? 
= 63,000 : 
and 
>) (x, — ¥2)* = 10? + 50? + 40? + 200% + 20? + 100? 
= 54,600 


Then, substituting these two quantities together with x, = 8,230, 
X, = 7,940, 6 = 0, n, = 5, and n, = 6 into the formula for ¢, we 
obtain / 
‘a (8,230 — 7,940) /5-6-9 
763,000 + 54,600 VII 


Since this exceeds 2.262 the value of ty).925 for 9 degrees of freedom, 
the null hypothesis will have jto be rejected; in other words, we con- 


= 4.19 


} 
} 
/ 
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clude that the average heat-producing capacity of the coal from the 
two mines is not the same. 


In this last example we arbitrarily went ahead and performed a two- 
sample t test, tacitly assuming that the population variances were equal. 
Fortunately, the test is not overly sensitive to small differences between 
the population variances, and the procedure used in this instance is quite 
justifiable. To be on safer grounds, however, we should first have tested 
whether the difference between the sample variances may be attributed to 
chance; a procedure for performing such a test will be given in Chapter 8. 

If the difference between the sample variances is large or if it is other- 
wise unreasonable to treat the population variances as being equal, we 
cannot use the two-sample ¢ test just described. However, there are several 
alternative methods that can be used instead, which do not require the 
assumption of equal population variances. In one of these, the paired- 
sample t test, we work with the differences of paired observations, one 
coming from each sample, and the pairing is random (unless the data are 
actually given as pairs, say, when we are dealing with the gasoline con- 
sumption of a number of cars before and after the installation of certain 
anti-pollution devices). Of course, if m, ~7,, some of the observations 
in the larger sample will have to be discarded. Using these differences, 
we then perform the one-sample ¢ test described in Section 7.7 to deter- 
mine whether their mean is significantly different from 0. 


Example. To determine the effectiveness of an industrial safety pro- 
gram, the following data were collected on the average weekly loss of 
man hours due to accidents in 12 plants during “one year before and 
one year after” the program was put into operation: 


37.1 and 28.0, 72.5 and 59.3, 26.6 and 24.7, 125.0 and 120.3 
45.8 and 46.2, 54.3 and 43.6, 13.2 and 15.4, 79.5 and 75.1 
12.6 and 18.3, 34.9 and 29.7, 39.3 and 35.0, 26.6 and 24.8 


The differences between these paired observations are 9.1, 13.2, 1.9, 
4.7, —0.4, 10.7, —2.2, 4.4, —5.7, 5.2, 4.3, and 1.8, and we shall want 
to check whether we can reject the null hypothesis w = 0 against the 
alternative hypothesis “4 > 0 (namely, that the industrial safety pro- 
gram is effective) at « = 0.05. Since the mean and the standard devia- 
tion of the differences are, respectively, x = 3.92 and s = 5,38, 
substitution into the formula for ¢t on page 213 with uw, = 0 yields 


_, 392 — 0 


Se es 
5.38/,/12 


t 
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Since this exceeds 1.796, the value of t,.,; for 11 degrees of freedom, 
we find that the null hypothesis will have to be rejected; in other 
words, we conclude that the industrial safety program is effective. 
Note, however, that if we had used the level of significance « = 0.01, 
we would not have been able to arrive at this conclusion, and this 
illustrates the important point that the level of significance should 
always be chosen before the data are analyzed in any fashion. 


Although this paired-sample ¢ test can be used when sampling from 
normal populations regardless of whether the samples are independent 
(and the pairing is random) or the population standard deviations are equal, 
it has two disadvantages. First, some of the data may have to be discarded 
when the sample sizes are unequal, and second, there is a serious loss of 
information in the sense that the test is performed as if there were only 
n observations instead of 2n observations. An alternate test which avoids 
these disadvantages when the samples are independent is given in Exercise 
22 below. 


EXERCISES 


1. A city’s police department is considering replacing the tires on its cars with 
radial tires. If 4; is the average number of miles they get out of their old 
tires and j/, is the average number of miles they will get out of the new tires, 
the null hypothesis they shall want to test is “4; — M2 = 0. 

(a) What alternative hypothesis should they use if they do not want to buy 
the radial tires unless they are definitely proven to give a better mileage? 
In other words, the burden of proof is put on the radial tires and the old 
tires are to be kept unless the null hypothesis can be rejected. 

(b) What alternative hypothesis should they use if they are anxious to get 
the new tires (which have some other nice features) unless they actually 
give a poorer mileage than the old tires? Note that now the burden of 
proof is on the old tires, which will be kept only if the null hypothesis 
can be rejected. 


2. A producer of extruded plastic products finds that his mean daily inventory 
is 1,250 pieces. A new marketing policy has been put into effect and it is 
desired to test the null hypothesis that the mean daily inventory is still the 
same. What alternative hypothesis should be used if 
(a) it is desired to know whether or not the new policy changes the mean 

daily inventory; 
(b) it is desired to demonstrate that the new policy actually reduces the mean 
daily inventory; 
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(c) the new policy will be retained so long as it cannot be shown that it 
actually increases the mean daily inventory? 


» An oceanographer wants to check whether the average depth of the ocean in 


a certain region is 57.4 fathoms, as had previously been recorded. What can 
he conclude at the level of significance & = 0.05 if soundings taken at 40 
random locations in the given region yielded a mean of 59.1 fathoms with a 
standard deviation of 5.2 fathoms? 


. Referring to the example on page 212 and using s = 126 as an estimate of 6, 


but changing the level of significance from & = 0.05 to & = 0.01, what 
action would you now take? Find from Table 8 the probability of a Type 
If error when = 1,050. 


. A random sample of 6 steel beams has a mean compressive strength of 
‘ 58,392 psi (pounds per square inch) with a standard deviation of 648 psi. 


Use this information and the level of significance & = 0.05 to test whether 
the true average compressive strength of the steel from which this sample 
came is 58,000 psi. 


», The specifications for a certain kind of ribbon call for a mean breaking 
‘ strength of 175 pounds. If six pieces of the ribbon (randomly selected from 


different rolls) have a mean breaking strength of 172.8 pounds with a stan- 
dard deviation of 7.9 pounds, test the null hypothesis “4 = 175 against the 
alternative hypothesis 4 < 175 at the level of significance & = 0.05. 


. Ina labor-management discussion it was brought up that workers (employed 


in a certain large plant) take on the average 53.6 minutes to get to work. 

(a) Show that this figure is substantiated (that is, the null hypothesis uw = 
53.6 cannot be rejected against the alternative hypothesis uw ~ 53.6 at 
the level of significance & = 0.05) by a survey in which a random sample 
of 60 workers took on the average 54.8 minutes with a standard deviation 
of 8.1 minutes. | 

(b) Using s = 8.1 as an estimate of o and referring to Table 8, find the 
probability that the criterion used here might lead to a Type II error 
when actually it takes workers on the average 57.6 minutes to get to 
work. 


: Test runs with six models of an experimental engine showed that they 


operated, respectively, for 24, 28, 21, 23, 32, and 22 minutes with a gallon of 
a certain kind of fuel. Is this evidence at the level of significance « = 0.01 
that in general this engine will operate below a desired standard (average) of 
29 minutes per gallon? 


. A random sample from a company’s very extensive files shows that orders 


for a certain piece of machinery were filled, respectively, in 10, 12, 19, 14, 15, 
18, 11, and 13 days. Use the level of significance & = 0.01 to test the claim 
that on the average such orders are filled in 10.5 days. Choose the alternative 
hypothesis so that rejection of the null hypothesis 4 = 10.5 implies that it 
takes longer than indicated. 
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10. 


11. 


@ 


Tests performed with a random sample of 40 diesel engines produced by a 
large manufacturer showed that they had a mean thermal efficiency of 31.4 
percent with a standard deviation of 1.6 percent. 

(a) Use this information and the level of significance &% = 0.01 to test the 
null hypothesis 44 = 32.3 percent against the alternative hypothesis that 
the true average thermal efficiency of the manufacturer’s diesel engines 
is less than 32.3 percent. 

(b) Using s = 1.6 percent as an estimate of o and referring to Table 8, 
find the probability with which this test criterion would expose us to a 
Type II error when actually the true average thermal efficiency of the 
manufacturer’s diesel engines is 31.7 percent. 

(c) How large a sample should have been used in the first place, so that the 
probability of the Type II error asked for in part (b) would have been 
only 0.15? 


The diameters of rotor shafts in a lot have a mean of 0.249 inch and a stan- 

dard deviation of 0.003 inch. The inner diameters of bearings in another lot 

have a mean of 0.255 inch and a standard deviation of 0.002 inch. 

(a) What are the mean and the standard deviation of the clearances between 
shafts and bearings selected from these lots? 

(b) If a shaft and a bearing are selected at random, what is the probability 
that the shaft will not fit inside the bearing? (Assume that both dimen- 
sions are normally distributed.) 


An investigation of two kinds of photocopying equipment showed that 75 

failures of the first kind of equipment took on the average 83.2 minutes to 

repair with a standard deviation of 19.3 minutes, while 75 failures of the 

second kind of equipment took on the average 90.8 minutes to repair with a 

standard deviation of 21.4 minutes. 

(a) Test the null hypothesis “4, — “2 = 0 (namely, the hypothesis that on 
the average it takes an equal amount of time to repair either kind of 
equipment) against the alternative hypothesis “4, — {42 ~ 0 at the level 
of significance & = 0.05. 

(b) Using 19.3 and 21.4 as estimates of a; and o2 and referring to Table 
8, find the probability of accepting the null hypothesis “4, — 2, =0 

ith the criterion of part (a) when actually “, — M2 = —12. 


i: Rendon samples are taken from two normal populations with 0; = 10.8 


14. 


and o, = 14.4 to test the null hypothesis “4; — 42 = 53.2 against the alter- 
native hypothesis 4; — [2 > 53.2 at the level of significance & = 0.01. Use 
Table 8 to determine the common sample size n = n; = nz that is required 
if the probability. of accepting the null hypothesis is to be_0.09 when 
Mi — pe = 66.7. 7 


Measuring Samples of nylon yarn taken from two spinning machines, it was 
found that 8 samples from the first machine had a mean denier of 9.67 with 
a standard deviation of 1.81 while 10 samples from the second machine had 
a mean denier of 7.43 with a standard deviation of 1.48. Use these results to 
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1. 


16. 


1. 


18. 


test the null hypothesis “4; — “, = 1.5 against the alternative hypothesis 
Ly — U2 > 1.5 at the level of significance & = 0.05. Assume that the popu- 
lations are normal and have the same variance. 


Measurements of the fat content of two kinds of ice cream, Brand A and 
Brand B, yielded the following sample data: 


Brand A: 13.5, 14.0, 13.6, 12.9, and 13.0 percent 
Brand B: 12.9, 13.0, 12.4, 13.5, and 12.7 percent 


Test the null hypothesis “4; = M2 (where 4; and £2 are the respective true 
average fat contents of the two kinds of ice cream) against the alternative 
hypothesis “4; ~ [2 at the level of significance & = 0.05. 


Studying the flow of traffic at two busy intersections between 4 p.m. and 

6 P.M. (to determine the possible need for turn signals), it was found that on 

40 weekdays there were on the average 247.3 cars approaching the first 

intersection from the south which made left turns, while on 30 weekdays 

there were on the average 254.1 cars approaching the second intersection 
from the south which made left turns. The corresponding sample standard 

deviations are s; = 15.2 and s, = 18.7. 

(a) Test the null hypothesis “4; — 2 = 0 against the alternative hypothesis 
Hy — Lo ~ Oat the level of significance & = 0.01. 

(b) Using 15.2 and 18.7 as estimates of 0, and oa, and referring to Table 
8, find the probability if accepting the null hypothesis uw, — , =0 
when actually |“; — 2| = 15.6. 

To test the claim that the resistance of electric wire can be reduced by at 


least 0.050 ohm by alloying, 25 values obtained for each, alloyed wire and 
standard wire, produced the following results: 


Standard 

Mean deviation 
Alloyed wire 0.083 ohm 0.003 ohm 
Standard wire 0.136 ohm 0.002 ohm 


Use the level of significance & = 0.05 to determine whether the claim has 
been substantiated. 


As part of an industrial training program, some trainees are instructed by 
Method A which is straight teaching-machine instruction, and some are 
instructed by Method B which also involves the personal attention of an 
instructor. If random samples of size 10 are taken from large groups of 
trainees instructed by each of these two methods, and the scores which they 
obtained in an appropriate achievement test are 


Method A: 71, 75, 65, 69, 73, 66, 68, 71, 74, 68 
Method B: 72, 77, 84, 78, 69, 70, 77, 73, 65, 75 
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f 
i 


20. 


21. 


test the claim that Method B is more effective. Use the level of significance 
OG = 0,05. 


19.) The following data were obtained in an experiment designed to check wheth- 


er there is a systematic difference in the weights obtained with two different 
scales: | 
ya ge 
— 


Weight in grams 


a 
Scale I Scale II 
Rock specimen 1 11.23 11,27 
Rock specimen 2 14.36 14.41 
Rock specimen 3 8.33 8.35 
Rock specimen 4 10.50 10,52 
Rock specimen 5 23.42 23.41 
Rock specimen 6 9.15 9.17 
Rock specimen 7 13.47 13.52 
Rock specimen 8 6.47 6.46 
Rock specimen 9 12.40 12.45 
Rock specimen 10 19.38 19.35 


Use the level of significance & = 0.05 to test whether the difference of the 
means of the weights obtained with the two scales is significant. 


The following are the Brinell hardness values obtained for samples of two 
magnesium alloys: 


Alloy A: 66.3, 60.7, 63.5, 64.9, 61.8, 64.3, 60.2, 62.0, 
64.7, 73.6, 65.1, 64.5, 68.4, 63.2, 64.0 

Alloy B: 71.3, 65.4, 69.6, 62.6, 63.9, 68.8, 69.2, 65.3, 
70.1, 64.8, 68.9, 70.7, 71.1, 65.8, 66.2 


Randomly pair each value obtained for Alloy A with a different value obtain- 
ed for Alloy B, and use the method described on page 220, the paired-sample 
t test, to test the null hypothesis “44 = Mg against the alternative hypothesis 
La < [pat the level of significance & = 0.05. Explain why it would probably 
have been unreasonable to use the “standard” two-sample ¢ test of Section 
7.8 in this exercise. 


The following are the number of sales which a sample of 9 salesmen of indus- 
trial chemicals in California and a sample of 6 salesmen of industrial chemi- 
cals in Oregon made over a certain fixed period of time: 


California: 59, 68, 44, 71, 63, 46, 69, 54, 48 
Oregon: 50, 31, 62, 52, 70, 41 


Randomly pair each value of the second sample with a different value of the 
first sample, and use the paired-sample t test to test the null hypothesis 
Ho — [41 = O against the alternative hypothesis “4o — WW; ~ 0 at the level 
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of significance & = 0.01. Explain why it would probably have been unrea- 
sonable to use the “standard” two-sample ¢ test of Section 7.8 in this exam- 
ple. 


22. When dealing with two independent random samples from normal popu- 


gk 


23. 


24. 


lations whose variances are not necessarily equal, the following Smith- 
Satterthwaite test can be used to test the null hypothesis uw, — “2, = 0. The 
test statistic is given by 


»_ (% — Xy) oe 0 
Si 52 
Ji+ % 


and its sampling distribution can be approximated by the ¢ distribution with 


2 2 
(St ae 52)! 
ny nN2 


see r a )? 
1 ae 2. 7 


degrees of freedom. Use this test for the data of Exercise 20 and compare the 
answer with the one previously obtained. 


Use the formula for ¢ on page 218 to construct a 1 — & confidence interval 
for 0, the difference between the two population means. 


Use the formula obtained in Exercise 23 to construct a 0.95 confidence 
interval for the true difference between the average resistance of the alloyed 
and standard wires referred to in Exercise 17. 
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8.1 THE ESTIMATION OF VARIANCES 


There were several instances in the preceding chapter where we estimated 
a population standard deviation by means of a sample standard deviation 
as defined by the formula 


ae - [Reo (x, — 3) 
n=] 
We substituted s for o in the large-sample confidence interval for w on 
page 187, in the large-sample test concerning yw on page 212, and in the 
large-sample test concerning the difference between two means on page 
216. Since there are many statistical procedures in which s is substituted 


for a, or s? for a”, let us show first that s? is, in fact, an unbiased estimate of 
o*; namely, that the mean of the sampling distribution of s* equals a’. 


227 
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If f(x;, X2,...,%,) is the joint density of the sample values x;,x.,..., 
and x,, it follows from the discussion on page 129 that the mean of the 
sampling distribution of s? is given by 


i. {. 7 \_ POF CE y hay vg eg) ON ER 0h, 


=| {_ al 5 AY Phy Kan ey) do dea. ds, 


—0co J —00 —-o]= 


If we now write 
n 7 n 7 
> Gy; — x) = Do xt — nx? 
i=1 i=1 


and interchange the operations of summation and integration, the expres- 
sion for the mean of the distribution of s* becomes 


— 7h \_ i ea \_ fC Wek eo) Oc BG 





ae a \_ - - [fC x, Scena Mel Oy loose AX, 


Assuming without loss of generality that the population mean y is equal 
to zero, we find that these last two integrals have already been evaluated 
on pages 166 and 167, where it was shown that their respective values are 
o? and o2/n. Thus, the mean of the sampling distribution of s* is given by 
n o” no? fo} 


n 
2 — 2 
oOo” — :_— = — et 9 5 
4 n—1l1 in n— | n— | 


| 
n— |} 














and this completes the proof of the unbiasedness of s? as an estimate of o?. 
(Note that, had we divided by 7 instead of n — | in defining s?, the result- 
ing estimator would have been biased; the mean of its sampling distribu- 





tion would have been a e o?.) 


Although the sample variance is an unbiased estimator of o”, it does 
not follow that the sample standard deviation is also an unbiased estimator 
of a; in fact, it is not. However, for large samples the bias is small and it is 
common practice to estimate o with s. 

Besides s, population standard deviations are sometimes estimated in 
terms of the sample range R, which we defined earlier as the difference 
between the largest and the smallest values in a sample. Given a random 
sample of size n from a normal population, it can be shown that the sam- 
pling distribution of R has the mean d,o and the standard deviation do, 
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where d, and d; are constants which depend on the size of the sample, as 
shown in the following table: 


n ps 5 4 5 6 7 8 7 10 








dz 1.128 1.693 2.059 2.326 2.534 2.704 2.847 2.970 3.078 








d3 0.853 0.888 0.880 0.864 0.848 0.833 0.820 0.808 0.797 





Thus, the statistic R/d, is an unbiased estimator for o, and the standard 
deviation of its sampling distribution is given by d,0/d,. For very small 
samples (n < 5), R/d, provides nearly as good an estimate of o as does s, 
but as the sample size increases it becomes far more efficient to use s. The 
range is used to estimate o primarily in problems of quality control, where 
sample sizes are usually small and computational ease is an important 
requirement. This application will be discussed in detail in Chapter 14. 


Example. To illustrate the use of R as an estimate of a, let us refer to 
the example on page 219, which dealt with the heat-producing capacity 
of the coal from two mines. For the first mine, the smallest of the five 
sample values is 8,070, the largest is 8,350, so that the range is R = 
8.350 — 8,070 = 280, and the corresponding estimate of o 1S 


R 280 __ 
dq, = 2326 120.4 
Note that this is close to the standard deviation of this sample, which 


equals 


———$——__—_——_ 


Pi oo = 4955 





In Exercise 2 on page 232, the reader will be asked to verify that for the 
sample from the other mine the estimates of o based on Rand s are, 
respectively, 118.4 and 104.5. 


Interval estimates of o or a” are almost always based on the sample 
variance. Dealing with random samples from normal populations, we make 
use of Theorem 6.4, according to which 


(n — 1)s? 
o” 


is a value of a random variable having the chi-square distribution with 
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n — | degrees of freedom. Thus, if v7 and y% cut off left- and right-hand 
tails of area «/2 under the chi-square distribution with n — 1 degrees of 
freedom, we can assert with a degree of confidence of 1 — « that 


ee 2 
<2 <9 


Solving this inequality for o?, we obtain the following 1 — a confidence 
interval for o?: 





n — l)s? n— 1)s? 
( » Zeek 2) 


X2 Xi 





If we now take the square root of each member of this inequality, we obtain 
a corresponding 1 — @ confidence interval for o. Note that the above 
confidence intervals, obtained by taking “equal tails,” do not actually give 
the shortest confidence intervals for g? and a, because the chi-square dis- 
tribution is not symmetrical. Nevertheless, they are used in most applica- 
tions in order to avoid complicated calculations. 


Example. To illustrate the construction of a confidence interval for o 
(or o”), let us return to the example on page 176, and let us suppose 
that the refractive indices of 20 pieces of glass (randomly selected from 
a large shipment purchased by an optical firm) had a variance of 
1.20-10~*. To construct a 0.95 confidence interval for a, we find from 
Table 5 that for 19 degrees of freedom 


Xi = X7o1s = 8.907 and yx? = y4,,; = 32.852 


Substituting these values together with n = 20 and s? — 1.20-10-4 
into the above confidence interval formula for ¢?, we get 


CML 20 10) < 9 < 420 10-9 
or 
0.000069 < a? < 0.000256 
and, hence, 
0.0083 < ao < 0.0160 


The method which we have discussed applies only to random samples 
from normal populations(or at least to random samples from populations 
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sufficiently close to normal so that the method provides a good approxi- 
mation). If the sample size is large, it can be shown that for samples from 
distributions that are approximately normal, the sampling distribution of 
s can be approximated closely with a normal distribution having the mean 
o and the standard deviation o/,/2n. Hence, 


S—O 


7 @/,/2n 


is a value of a random variable having approximately the standard normal 
distribution, and solving the inequality 





S—oO 
a = < 2,2 


a/,/2n 


for a, we thus obtain the following 1 — « large-sample confidence interval 
for co: 





Example. To illustrate the use of this procedure, let us refer to the 
example on page 170, where 40 one-gallon cans of a certain kind of 
paint covered on the average 513.3 square feet with a standard devia- 
tion of 31.5 square feet. To construct a 0.99 confidence interval for o, 
we substitute n = 40, s = 31.5, and Zp 99; = 2.58 into the formula, 
getting 


15 ew NS 
1 4 258 , _ 258 
»/80 »/80 


and 
24.5< 0 < 44.3 


EXERCISES 


it Use the data of Exercise 8 on page 222 to estimate o for the number of 
minutes the experimental engine will operate with the given fuel 
(a) in terms of the sample standard deviation; 
(b) in terms of the sample range. 


232 Ch. 8 Inferences Concerning Variances 


Compare the two estimates by expressing their difference as a percentage of 


Ae first. 
. With reference to the example on page 229, show that for the sample from 
the second mine the estimates of o based on Rand s are, respectively, 118.4 
and 104.5. 


3. Use the data of Exercise 20 on page 225 to estimate o for the Brinel/ hardness 
of Alloy A 
(a) in terms of the sample standard deviation: 
(b) in terms of the sample range. 
ompare the two estimates by expressing their difference as a percentage of 
the first. 


. Use the information given in Exercise 6 on page 222 to construct a 0.95 
confidence interval for the standard deviation of the breaking strength of 
the given kind of ribbon. 


= With reference to Exercise 9 on page 222, construct a 0.99 confidence interval 
for the variance of the amount of time it takes to fill an order for a piece of 
e given kind of machinery. 


- While performing a strenuous task, the pulse rate of 25 workers increased on 
the average by 18.4 beats per minute with a standard deviation of 4.9 beats 
per minute. Find a 0.95 confidence interval for the corresponding population 
standard deviation, using 
(a) the small-sample technique based on the chi-square distribution; 

(b) the large-sample technique based on the normal distribution. 
Compare the two confidence intervals. 


7. Using the data of part (a) of Exercise 7 on page 222, construct a 0.99 confi- 
dence interval for the standard deviation of the amount of time it takes 
workers (employed at the given plant) to get to work. 


8/In the example on page 230 we indicated that the refractive indices of 20 
pieces of glass had a variance of 1.20-10-4. Using ,/1.20-10-4 = 0.0ll asa 
preliminary estimate of o and the large-sample theory about the sampling 
distribution of s on page 231, find the minimum sample size needed to esti- 
mate o with a maximum error of 0.0025 at a 0.95 degree of confidence. 


(9. using the value of s obtained in Exercise 3, construct a 0.98 confidence 
bal nterval for 7, measuring the actual variability in the hardness of Alloy A. 





46, If 32 measurements of the boiling point of sulfur hada mean of 444.8°C and 
““ a standard deviation of 0.83°C, construct a 0.99 confidence interval for the 
true standard deviation of such measurements. 


11. With reference to Exercise 10 on page 223, construct a 0.95 confidence 
interval for o, measuring the true variability of the thermal efficiency of such 
diesel engines. 
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8.2 HYPOTHESES CONCERNING ONE 
VARIANCE 


In this section we shall consider the problem of testing the null hypothe- 
sis that a population variance equals a specified constant against a suitable 
one-sided or two-sided alternative; that is, we shall test the null hypothesis 
ao” = o% against one of the alternatives g? < a3, o? > aj, or 07? 4a}. 
Tests like these are important whenever it is desired to control the unifor- 
mity of a product or an operation. For example, suppose that a silicon 
disc, or “wafer,” is to be cut into small squares, or “dice,” to be used in the 
manufacture of a semiconductor device. Since certain electrical character- 
istics of the finished device may depend on the thickness of the die, it is 
important that all dice cut from a wafer have approximately the same 
thickness. Thus, not only must the mean thickness of a wafer be kept 
within specifications, but also the variation in thickness from location to 
location on the wafer. 

Using the same sampling theory as in the preceding section, namely, 
the fact that for random samples from normal population with the vari- 
ance a2 


5. Ca Ts? 


x 


ory 





is a value of a random variable having the chi-square distribution with 
n — 1 degrees of freedom, we can use this y? statistic to test the null 
hypothesis o? = oj as shown in the following table: 


CRITICAL REGIONS FOR TESTING Ho: 02 = o% 
(Normal Population) 






Reject Ho if for 
n — 1 degrees 
of freedom 











Alternative 
hypothesis 





Y Sea = X4 0/2 
or x2 > 42/2 
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In this table v2 is as defined on page 176. Note that “equal tails” are used 
in performing the two-tail test, which is actually not the “best” procedure 
since the chi-square distribution is not symmetrical. 


Example. To illustrate this type of chi-square test, let us assume that 
the thicknesses of 15 dice cut from a silicon wafer have a standard 
deviation of 0.64 mil and that the lapping process which ground the 
wafers to the proper thickness is acceptable only if a, the population 
standard deviation of the dice thicknesses, is at most 0.50 mil. This 
means that we shall want to test the null hypothesis H):o0 = 0.50, 
against the alternative H,: 0 > 0.50, and we shall do so at a level of 
significance of 0.05. Since this is equivalent to testing the null hypothe- 
sis 0? = (0.50)? = 0.25 against the alternative a? > 0.25, the value of 
the test statistic becomes 


2» — THOS" = 22.94 


x 25 


which does not exceed 23.685, the value of v3); for 14 degrees of 
freedom. Thus, the null hypothesis cannot be rejected; even though the 
sample standard deviation exceeded 0.50, the evidence is not sufficient 
to arrive at the conclusion that the lapping process is unsatisfactory. 


There exist tables, like Table 8, which enable us to read off the prob- 
abilities of Type II errors connected with this kind of test. As given in the 
National Bureau of Standards Handbook 91 (see Bibliography at the end of 
the book), they contain the OC curves for the different one-sided and two- 
sided alternatives, for « = 0.05 and a = 0.01, and for various values of 7. 


The quantity which we mark on the horizontal scale is the ratio at, where 


0 
a, is the alternative value of o for which we want to determine the prob- 
ability of erroneously accepting the null hypothesis 0 = d». 
If the population from which we are sampling is not normal but the 
sample size is /arge (n > 30 is the usual rule of thumb), the null hypothesis 
o = 0, can be tested with the use of the statistic 


S — do 


7 Gola/2n 





whose sampling distribution is approximately the standard normal distri- 
bution. The only difference in the tests is that z and z, replace y” and yZ. 
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Example. The specifications for the mass production of certain 
bearings require, among other things, that the standard deviation of 
their diameters should not exceed 0.0040 cm. If a random sample of 
size n = 35, taken to test the null hypothesis o = 0.0040 against the 
alternative o > 0.0040 at the level of significance « = 0.01, yielded 
s = 0.0053 for the diameters of the bearings, we get 


__ 0.0053 — 0.0040 _ 179 
“=~ 0.0040/,/70. 
Since this exceeds Z) 9; = 2.33, we find that the null hypothesis will 
have to be rejected; the bearings do not meet the specifications with 
regard to the variability of their diameters. 


8.3 HYPOTHESES CONCERNING TWO 
VARIANCES 


The two-sample ¢ test for the difference between two means, described 
in Section 7.8, requires the assumption that the population variances are 
equal. Before proceeding with this test, therefore, it would be desirable to 
put this assumption to a test. In this section we shall describe a test of the 
null hypothesis H,: 07 = o% against an appropriate alternative, which 
applies to independent random samples from two normal populations. As 
we shall discover in Chapter 12, the test has many other important applica- 
tions. 

If independent random samples of size n, and n, are taken from normal 
populations having the same variance, it follows from Theorem 6.5 that 
the statistic 


hy 
| 

cae 

WN] bv 


is a value of a random variable having the F distribution with n, — 1 and 
n, — 1 degrees of freedom. Thus, if the null hypothesis oj = a} is true, the 
ratio of the sample variances s? and s3 provides a statistic on which tests 
of the null hypothesis can be based. 

The critical region for testing H, against the alternative hypothesis 
o? > ois F > F,, where F, is as defined on page 177, namely, it cuts off 
a right-hand tail of area « under the F distribution with n, — 1 andn, — 1 
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degrees of freedom. Similarly, the critical region for testing H) against the 
alternative hypothesis a? < a3 is F < F,_,, and this causes some difficul- 
ties since Table 6 only contains values corresponding to right-hand tails 
of « = 0.05 and « = 0.01. Asa result, we use the reciprocal of the original 
test statistic and make use of the relation 


PraglVys V2.) = | 


FV25 V1) 





first given on page 178. Thus, we base the test on the statistic F = s3/s{ 
and the critical region for testing Hj): oj = 03 against H,: 07 <a} becomes 
F > F,, where F, is the appropriate critical value of F withn, — 1 and 
n, — | degrees of freedom. 

For the two-sided alternative of ~ o} the critical region is F< Fy_,/. 
or F > F,,., where F = s{/sz and the degrees of freedom are n, — | and 
n, — 1. In practice, we modify this test as in the preceding paragraph, so 
that we can again use the table of F values corresponding to right-hand 
tails of ~« = 0.05 and w = 0.01. To this end we let s%, represent the larger 
of the two sample variances, s?, the smaller, and we write the corresponding 
sample sizes as my, and n,,. Thus, the test statistic becomes F = si,/s7, and 
the critical region is as shown in the following table: 


CRITICAL REGIONS FOR TESTING Ho: o? = o3 
(Normal Populations) 






Alternative 


hypothesis Test statistic Reject Ho if 










F> F,(n2 <= 1, 1 ai 1) 


> Fyn aie 1, 2 =i) 


F> Fyj2(t4m — 1, 4m — 1) 


The level of significance of these tests is « and the figures indicated in 
parentheses are the respective degrees of freedom. Note that, as in the 
chi-square test, “equal tails” are used in the two-tail test as a matter of 
mathematical convenience, even though the F distribution is not sym- 
metrical. 
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eae an example of a one-tail test for the equality of two 
variances, suppose that it is desired to determine whether Company 1 is 
better at plating silverware than Company 2. Assuming that there is 
essentially no difficulty in controlling the average thickness of the 
plating, we shall base our decision on the variability of its thickness. 
To set up the test so that rejection of the null hypothesis will enable us 
to conclude that the plating done by Company 1 is better (less variable) 
than that done by Company 2, we shall have to test the null hypothesis 
o} = of against the alternative hypothesis ¢? < o3, say, at the level of 
signficance & = 0.05. Supposing now that random samples of size 12 
of the work done by each company yields s, = 0.035 mil and s, = 

~ 0.062 mil for the thickness of the plating, we find that the appropriate 
test statistic has the value 


(0.062)? 


~ (0.0352 ae 





Comparing this value with 2.82, the value of Fy ,;(11,11) obtained 
from Table 6(a) by linear interpolation,* we conclude that the null 
hypothesis will have to be rejected; in other words, the data support 
the contention that the plating done by Company 1 is less variable 
than that done by Company 2. 


Example. To give an example of a two-tail test for the equality of two 
variances, let us return to the example on page 219, where we com- 
pared the average heat-producing capacity of the coal from two mines. 
In that example we went ahead and performed the two-sample f test 
even though there was a difference between the variances of the two 
samples. To justify this procedure, let us now test whether 07 = o2, and 
let us use the alternative hypothesis oj ~ 0%, since we are concerned 
only with the question whether or not these two variances are equal. 
Since we had n, = 5, n, = 6, s? = ese == 15,750, and s; = 24,000 


= 10,920 on page 219, we get 





_ 5? 15,750 _ 
oa e990 





Using « = 0.02, we find from Table 6(b) that Fy »,(4, 5) = 11.4, and 
it follows that the null hypothesis cannot be rejected. In other words, 
it was justifiable to use the two-sample f test in the given example. 


*Note that it was really unnecessary to interpolate in this case since 3.14 exceeds _ 
both Fo.05(10, 11) and Fo.o0s(12, 11). 
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Had we wanted to use the level of significance « = 0.05 or « = 0.01 in 
this example, we would have required tables of the values of Fo 925(V1; V2) 
or F'y.99s(V1, V2); such tables may be found in the Biometrika Tables for 
Statisticians listed in the Bibliography at the end of the book. Also, OC 
curves for the various F tests discussed in this section may be found in the 
book by A. H. Bowker and G. J. Lieberman listed in the Bibliography, 
while those for the one-tail tests only may be found in the National Bureau 


of Standards Handbook 91. 


EXERCISES 


1. Referring to Exercise 5 on page 222, test thato = 600 psi for the compressive 
strength of the given kind of steel. Use the alternative hypothesis @ > 600 
nd the level of significance & = 0.05. 


’ Tf 12 determinations of the specific heat of iron have a standard deviation of 
0.0086, test the null hypothesis that @ = 0.010 for such determinations. Use 
the alternative hypothesis o ~ 0.010 and the level of significance & = 0.01. 


3. Referring to Exercise 12 on page 223, test the null hypothesis that for the 
first kind of photocopying equipment the variability of the time that is 
r quired for repairs is given by o = 15.0 minutes. Use the alternative 
fhypothesis o > 15.0 and the level of significance & = 0.05. 


4! Test the null hypothesis that o = 0.015 inch for the diameters of certain 
bolts, if in a random sample of size 15 the diameters of the bolts had a vari- 
i. of 0.00011. Use the level of significance « = 0.01. 
5/R 


// Referring to Exercise 6 on page 232, test the null hypothesis that for such 
~ inereases of the pulse rate (while performing the given task) the variance can 
fe expected to equal o? = 30.0. Use the alternative hypothesis o2 < 30.0. 
6 
7 


Playing ten rounds of golf on his home course, a golf professional averaged 
1.3 with a standard deviation of 1.32. Test the null hypothesis that the con- 

sistency of his game on his home course is actually measured by 0 = 1.20, 

against the alternative hypothesis that he is less consistent. Use & = 0.05. 


7. Past data indicate that the variance of measurements made on sheet metal 
stampings by experienced quality control inspectors is 0.18 square inch. 
Such measurements made by an inexperienced inspector could have too 
large a variance (perhaps because of inability to read instruments properly) 
or too small a variance (perhaps because unusually high or low measurements 
are discarded). If a new inspector measures 100 stampings with a variance of 
0.13 square inch, test at the 0.05 level of significance whether the inspector is 
making satisfactory measurements. 
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Peon -he use of the two-sample ¢ test in Exercise 14 on page 223 by testing 
the null hypothesis that the two populations have equal variances. Use the 
level of significance & = 0.02. 


9. Justify the use of the two-sample ¢ test in Exercise 15 on page 224 by testing 
the null hypothesis that the two populations have equal variances. Use the 
vel of significance & = 0.02. 


10. Two different lighting techniques are compared by measuring the intensity 
of light at selected locations in areas lighted by the two methods. If 15 mea- 
surements in the first area had a standard deviation of 2.7 foot-candles and 
21 measurements in the second area had a standard deviation of 4.2 foot- 
candles, can it be concluded that the lighting in the second area is less uni- 
form?-Use a 0.01 level of significance. What assumptions must be made as 
to”~how the two samples are obtained ? 


1%. In Exercise 16 on page 224 we had n; = 40, n, = 30, sy = 15.2, and s, = 
18.7. Use the level of significance & = 0.05 to test the claim that at the second 
intersection there is a greater variability in the number of cars which make 
left turns approaching from the south between 4 p.m. and 6 P.M. 


12. Pull-strength tests on 10 soldered leads for a semiconductor device yield the 
following results in pounds force required to rupture the bond: 


15.8, 12,7, 13.2, 269, TOG. 18.8, 21,1, 143, 170, 123 


Another set of 8 leads was tested after encapsulation to determine whether 
the pull strength has been increased by encapsulation of the device, with the 
following results: 


24.9, 23.6, 19.8, 22.1, 20.4, 21.6, 21.8, 22.5 


How would you test the implied hypothesis ? Why ? 


13. Random samples of size n; and 2, respectively, are taken from two log- 
normal populations, and the resulting sample means are x, = 3.74 and 
X2, = 13.91. You wish to test whether the second population has a mean 
value four times as large as the first. 
(a) Can you directly use a two-sample test? Why? 
(b) Is there a transformation that can be made on the data that could con- 
ceivably allow the use of a two-sample test? 





Inferences 
Concerning 
Proportions 





9.1 ESTIMATION OF PROPORTIONS 


Many engineering problems deal with proportions, percentages, or 
probabilities. In acceptance sampling we are concerned with the proportion 
of defectives in a lot, and in life testing we are concerned with the percent- 
age of certain components which will perform satisfactorily during a stated 
period of time, or the probability that a given component will last at least 
a given number of hours. It should be clear from these examples that 
problems concerning proportions, percentages, or probabilities are really 
equivalent; a percentage is merely a proportion multiplied by 100, and 
a probability can be interpreted as a proportion “in the long run.” 

The information that is usually available for the estimation of a pro- 
portion is the number of times, x, that an appropriate event has occurred 
in n trials (or observations). The point estimate, itself, 1s usually the 


240 
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relative frequency x/n, namely, the proportion of the time that the event 
has actually occurred. If the x trials satisfy the assumptions underlying the 
binomial distribution listed on pages 54 and 55, we know that the mean and 
the standard deviation of the number of “successes” are given by mp and 
»/np(1 — p). Dividing both of these quantities by 1, we find that the mean 
and the standard deviation of the proportion of successes (namely, the 
relative frequency x/n) are given by 





=p and p22 





This shows that the sample proportion may be regarded as an unbiased 
estimator for the parameter p of a binomial distribution, namely, the 
“true” proportion we are trying to estimate on the basis of the sample. 

In the construction of confidence intervals for the parameter p of the 
binomial distribution, we meet several obstacles. First, there is the fact 
that the binomial distribution is discrete, so that it may be impossible to 
get an interval with a degree of confidence exactly equal to 1 — «; second, 
the standard deviation of the sampling distribution of x (or that of the 
sample proportion x/n) involves the parameter p we are trying to estimate. 
To construct a confidence interval for p having approximately a degree of 
confidence of 1 — «, we first select for a given set of values of p the corre- 
sponding quantities x) and x,, where x, is the Jargest integer for which 


3 b(k;n, p) <= 
k=0 


N| 


while x, is the smallest integer for which 


n 


3) bk; np) <% 


k=x) 


To emphasize the fact that x, and x, depend on the value chosen for p, 
we shall write these quantities as x,(p) and x,(p). We can then assert with 
a probability of approximately 1 — a (and at least 1 — a) that 


Xp) <x < x,(p) 


for any given value of p. To change these inequalities into confidence 
intervals for p, we can use a simple graphical method which is illustrated 
in the following example: 
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Example. Suppose we want to find approximate confidence intervals 
for p for samples of size n = 20. Using Table 1, we first deterimine xX, 
and x, for selected values of p such that x, is the /argest integer for 
which 


Be, ; 20, 2) = 0.025 
while x, is the smallest integer for which 
1 — B(x, — 1; 20, p) < 0.025 


Letting p equal 0.1, 0.2, 0.3,..., and 0.9, we thus obtain the values 
shown in the following table: 





Plotting the points (p, x.(p)) and (p, x;(p)) as we did in Figure 9.1, and 
drawing smooth curves, one through the x, points and one through 
the x, points, we can now “solve” for p. For any given value of x, 
we obtain approximate 0.95 confidence limits for p by going horizontal- 
ly to the two curves and marking off the corresponding values of p 
(see Figure 9.1). Thus, for x = 4 we obtain the approximate 0.95 
confidence interval 0.05 < p < 0.45. 


Graphs similar to the one shown in Figure 9.1 are given in Tables 9(a) 
and 9(b) at the end of the book for various values of m and the degrees 
of confidence 0.95 and 0.99. These tables differ from the one of Figure 
9.1 in that the relative frequency x/n is used instead of x, thus making it 
possible to graph curves corresponding to various values of m on the 
same diagram. Also, for increased accuracy, Tables 9(a) and 9(b) are 
arranged so that values of x/n from 0 to 0.50 are marked on the bottom 
scale while those from 0.50 to 1.00 are marked on the top scale of the 
diagram. For values of x/n from 0 to 0.50 the confidence limits for p are 
read off the left-hand scale of the diagram, while for values of x/n from 
0.50 to 1.00 they are read off the right-hand scale. 


Continuation of Example. Note that for m = 20 and x = 4 we obtain 
the 0.95 confidence interval 0.06 < p < 0.44 with the use of Table 
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Figure 9.1. Confidence intervals for proportions (7 = 20). 


9(a). This is very close, indeed, to the confidence interval obtained 
earlier with the use of the somewhat cruder Figure 9.1. 


When n is large and there is no reason to suspect that p is very close to 
either 0 or 1, we can construct approximate confidence intervals for Pp 
making use of the normal curve approximation to the binomial distribu- 
tion. Since the mean and the standard deviation of the binomial distribu- 
tion are np and ,/np(1 — p), we can assert with a probability of 1 — a 
that x, the number of successes we obtain in 7 trials, will satisfy the 
inequality 


x — np 


Jno — py ~ * 


Solving this quadratic inequality for p, we can obtain a corresponding set 
of (approximate) confidence limits for p (see Exercise 10 on page 250). 
However, since the necessary calculations are involved, we shall use the 
further approximation of substituting the sample proportion x/n for p in 


»/np(l — p). This yields the following 1 —« large-sample confidence 
interval for p: 


— 24/2 sd 
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Example. For instance, for n = 100 and x = 36 we obtain the 0.95 
large-sample confidence interval 


_ (0.36)(0.64) qo 
0.36 1.96 4f O30)0.68) < p <= 0,36 + 1.96 ae (ct 


OF 
0.266 < p < 0.454 


Had we used Table 9(a) for this example, we would have obtained 
0.27 < p< 0.46. 
The magnitude of the error we make when using x/n as an estimate of 


p is given by , and using the normal curve approximation to the 








x 
“ Pp 


binomial distribution we can assert with a probability of 1 — o that 


|x — np| , |= | (a=® 
am ona a | cn 1 ee ae A aD, 
»/np(l — p) a ee fe n 


To be able to use this last formula, we again substitute x/n for p inside 
the radical and, so long as we are dealing with a large sample, we can 
assert with a probability of | — « that the error in using x/n as an estimate 
of p is less than 





5 
E = Za /2 ci ee 8 


n 





Example. For instance, if 136 of 400 persons interviewed as part 
of a poll felt their city’s public transportation system is adequate 
and we estimate the corresponding true proportion as $28 = 0.34, we 
can assert with a probability of 0.95 that the error of this estimate 
does not exceed 


7 (0.34) (0.66) _ 
E = 1.96,/—= =? = 0.046 
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The formula on which the above estimate of the error was based can 
also be used to determine the sample size that is required to attain a desired 


degree of precision, or reliability. Solving for n, we get 


n= p(l — p)| 22] 


but this formula cannot be used as it stands unless we have some informa- 
tion about the possible size of p (on the basis of collateral data, say, a pilot 
sample). If no such information is available, we can make use of the 
fact that p(1 — p) is at most equal to 4, as can be shown by the method of 
elementary calculus. Thus, if 


we can assert with a probability of at /Jeast 1 — ow that the error in using 
x/n as an estimate of p is less than E. 


Example. If we want to use a sample proportion to estimate the true 
proportion of defectives in a very large shipment of adobe bricks, and 
we want to be able to assert with a degree of confidence of at least 
0.95 that the size of the error is less than 0.04, we would have to take 
a sample of size 


1 /1.96\2 
n= 4(59a) — 600,25 


that is, a sample of size 601 (rounded up). Had we known from experi- 
ence with similar adobe brick that the proportion we are trying to 
estimate is in the neighborhood of 0.12, substitution into the appro- 
priate formula would have yielded 


aT ae p)| 732] = (0.12)(0.88)| a2 | aHO5RS 


that is, a sample of size 254. This serves to illustrate how some collateral 
information about the possible size of p can substantially reduce the 
size of the required sample. 
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If p is very close to 0 or 1, none of the confidence intervals we have 
described provides a satisfactory approximation, even when 7 is large. 
Actually, in cases where p is close to 0, as might be encountered in testing 
for high reliability, we are really more interested in one-sided confidence 
intervals of the form p < C; that is, we are interested mainly in finding 
an upper confidence limit for p. (When p is close to 1, we would, cor- 
respondingly, be interested in an upper confidence limit for 1— p and, 
hence, a Jower confidence limit for p.) As we already pointed out on page 
79, the binomial distribution is best approximated with a Poisson dis- 
tribution (rather than a normal distribution) when p is small and n is 
large. Thus, using the relation 2 = np, we can obtain an upper con- 
fidence limit for p based essentially on the Poisson distribution. Without 
going into any detail, let us merely state the result that this leads to the 
approximate 1 — « confidence interval 





J. y? 


P= 5, Xa 





where v2 is as defined on page 176 and the number of degrees of freedom 
equals 2(x + 1). A discussion of this approximation may be found in the 
book by A. Hald mentioned in the Bibliography at the end of the book. 


Example. For x = 4, n= 2,000, and « = 0.01, for instance, we 
obtain the 0.99 confidence interval 


| 2 
P< 375-000)" 23.209 = 0.0058 


where 23.209 is the value of v2 5, for 2(4 + 1) = 10 degrees of free- 
dom. Note that if we had erroneously used the large-sample con- 
fidence interval formula on page 244, we would have obtained 0.0020 
+ 0.0026, namely, the much narrower interval p < 0.0046. 


9.2 BAYESIAN ESTIMATION 


In the preceding section we looked upon the “true” proportions we 
tried to estimate as unknown constants; in Bayesian estimation these 
parameters are looked upon as random variables having prior distributions 
reflecting either the strength of one’s belief about the possible values they 
can take on, or other indirect information. As in Section 7.3, we are thus 
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faced with the problem of combining prior information with direct sample 
evidence. 


Example. To illustrate how this might be done, suppose. that a manu- 
facturer, who regularly receives large shipments of electronic com- 
ponents from a vendor, knows that about 25 percent of the time 0.005 
(half of one percent) of the components are defective, about 25 percent 
of the time 0.01 of the components are defective, and about 50 percent 
of the time 0.02 of the components are defective. Thus, before a ship- 
ment from this vendor is inspected, we have the following prior dis- 
tribution for the proportion of defectives: 


Value Prior 
of p probability 
0.005 0.25 
0.01 0.25 
0.02 0.50 


Now suppose that 200 of these components, randomly selected 
from the shipment, are inspected, and only one of them is found to 
be defective. The probability of this happening when p = 0.005, 


p = 0.01, orp = 0.02 are, respectively, (*)°)(0.005)' (0.995)!9 = 0,37, 


(*)°)(0.01)"(0.99)"9 = 0.27, and (“") (0.02)'(0.98)'?? = 0.07, where 


we used the formula for the binomial distribution on page 56 and 
logarithms to simplify the calculations. Combining these probabilities 
with the prior probabilities by means of the formula for the rule of 
Bayes (Theorem 2.8 on page 39), we find that the posterior probability 
for p = 0.005 is 


(0.25)(0.37) _047 
(0.25)(0.37) + (0.25)(0.27) + (0.50)(0.07) 


and that the corresponding posterior probabilities for p = 0.01 and 
p = 0.02 are, respectively, 0.35 and 0.18. We have thus arrived at the 
following posterior distribution for the proportion of defective com- 
ponents: 


Value Posterior 
of p probability 
0.005 0.47 
0.01 0.35 
0.02 0.18 
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Note that whereas the odds were originally 3 to 1 against p =: 0.005, it 
is now almost an even bet; of course, this shift is accounted for by 


the fact that in the sample only 54, = 0.005 of the components 
inspected were defective. 





It was assumed in this example that p had to be 0.005, 0.01, or 0.02, 
and this restriction was imposed mainly to simplify the calculations; the 
method would have been the same if we had considered 10 different 
values of p, or even 100. It would be more logical, perhaps, to let p take 
on any value on the continuous interval from 0 to 1, and in that case 
it is customary to use as the prior distribution the beta distribution of 
Section 4.8. The paranieters of this distribution are a and f, and its 
mean, /,, and standard deviation, a), can be expressed in terms of w and 
B by means of the formulas on page 120. Then, proceeding as on page 
190, it can be shown that the posterior distribution of p, namely, the 
conditional distribution of p for a given (observed) value of x, is also a beta 
distribution, and that its parameters are x + « instead of andn —x + B 
instead of f.* Thus, it follows from the formula on page 120, that the mean 
of the posterior distribution of p is given by 


aa 


ae I, 
rr a ae 





and we have thus arrived at a formula for a Bayesian point estimate of the 
“true” proportion with which we may be concerned in any given example. 


Example. To illustrate these ideas, suppose that a person doing 
research for a large oil company “feels” that the proportion of cus- 
tomers requiring oil as well as gasoline is a random variable having 
as its prior distribution the beta distribution with « = 60 and f = 
300. (This “feeling” might have been based on an analysis of prior 
experience.) To picture this prior distribution, note that according 
to the formulas on page 120, 


60 


Lo — 60 + 300 = 0.167 


*Proofs of these results may be found in the book by J. E. Freund listed in the Bib- 
liography at the end of the book. 
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and 


60-300 


= (60 F 300)(60 + 300 1) ~ 2-00038° 


or 

Thus, the mean of this distribution is 0.167 and its standard deviation 
is about 0.020. 

Now suppose that the person doing the research gets some new 

data, showing that in a random sample of size n = 800, there were 

only x = 94 who needed oil as well as gasoline. Combining these data 


with the prior information, we find that the mean of the posterior 
distribution, the point estimate of the “true” proportion, is 


94 + 60 


41 = $0 4- 300 800 Ot? 


and in Exercise 17 on page 252 the reader will be asked to verify that 
the standard deviation of the posterior distribution is o, = 0.010. 


Given an appropriate table, it is possible to continue with an example like 
this and calculate the posterior probabilities that are associated with 
various intervals. For instance, in our example we would be able to 
determine the posterior probability that the true proportion of cus- 
tomers needing oil as well as gasoline is between 0.13 and 0.14, or, say, 
between 0.12 and 0.15. | 


EXERCISES 


ae sample of 200 claims filed against an insurance company writing 
collision insurance on cars, 84 exceeded $1,200. Use Table 9 to construct a 
0.95 confidence interval for the true proportion of claims filed against this 
company that exceed $1,200. 


2. Ina sample of 100 reports of UFO sightings, 83 could easily be explained in 
terms of natural phenomena. Use Table 9 to construct a 0.99 confidence 
interyal for the corresponding true proportion of reported UFO sightings 
that can easily be explained in terms of natural phenomena. 


. Ina random sample of 300 industrial accidents, it was found that 173 were 
due at least partially to unsafe working conditions. Construct a 0.95 confi- 
dence interval for the corresponding true proportion 
(a) using Table 9; 

(b) using the large-sample confidence interval formula on page 244. 
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4. Inarandom sample of 160 workers exposed to a certain amount of radiation, 
24 experience some ill effects. Construct a 0.99 confidence interval for the 
corresponding true percentage 

(a¥ using Table 9; 

) using the large-sample confidence interval formula on page 244. 






&% In a sample survey, 200 persons with incomes of $12,000 or more were 
asked “Where would you most likely find out all there is to know about 
some news in which you are very much interested?” If 112 replied “televi- 
sion,” construct a 0.95 confidence interval for the corresponding true pro- 
portion 
(a) using Table 9; 

(b) using the large-sample confidence interval formula on page 244. 


6. In a sample survey of the “safety explosives” used in certain mining opera- 
tions, explosives containing potassium nitrate were found to be used in 95 
of 250 cases. 

(a) Use Table 9 to construct a 0.95 confidence interval for the correspond- 
ing true proportion. 
(b) If 25 = 0.38 is used as an estimate of the corresponding true propor- 


250 
| tion, what can we say with a probability of 0.95 about the possible size 


{ of our error? 

4, In a recent study, 69 of 120 meteorites were observed to enter the earth’s 
atmosphere with a velocity less than 26 miles per second. What can we say 
with a probability of 0.99 about the possible size of our error, if we estimate 

Le corresponding true proportion as {a = = 0.35/57 


What is the size of the smallest sample required to estimate an unknown 
proportion to within a maximum error of 0.05 with a degree of confidence of 
at least 0.95. How would the required sample size be affected if it were known 
that the proportion to be estimated is in the neighborhood of 0.25? 


€ 


9, What is the minimum sample size a public opinion poll has to use if it wants 
to be able to assert with a probability of at least 0.95 that its estimate of the 
percentage of the vote a certain candidate will get is not “off” by more than 
3 percent? How would the required minimum sample size be affected if it 
were known that the percentage to be estimated is in the neighborhood of 
60 percent? 






10. Show that the double inequality on page 243 leads to the following 1 — & 
confidence limits: 
xe 2p oe Z/n[ = 2a ae 52 


n-+ zi/2 


11. Use the confidence interval formula of Exercise 10 to rework Exercise 3 and 
compare the results. 


12. In 4,000 firings of a certain kind of rocket there were ten instances in which 
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a rocket exploded upon ignition. Construct an upper 0.95 confidence limit 
for the probability that such a rocket will explode on ignition. 


Observing the amount of pollutants in the air on 500 days, it was found that 
the amount of pollutants in the air in a western city exceeded 200 micrograms 
per cubic meter only four times. Construct an upper 0.99 confidence limit 
for the probability that the air pollution in this city will exceed 200 micro- 
grams per cubic meter on any one day. 


. The head of a highway department feels that 4 out of 5 road building jobs 


15. 


16, 


will stay within cost estimates, while his assistant feels that it should be only 

3 out of 5. 

(a) If the head of the highway department is regarded to be “three times as 
good” as his assistant in determining figures like these, what prior prob- 
abilities should we assign to their respective claims? 

(b) What posterior probabilities would we assign to their claims if it was 
found that among 12 road building jobs (randomly selected from the 
department’s files) only two stayed within cost estimates ? 

(c) Calculate the mean of the posterior distribution as a Bayesian point 
estimate of the probability that one of the department’s road building 
jobs will stay within cost estimates, and compare it with corresponding 
point estimates based, respectively, on the prior information alone and 
on the direct sample evidence alone. 


e aver agent of a firm feels that the probability is 0.80 that any one 
of several shipments of steel recently received will meet specifications. The 
head of the firm’s quality control department feels that this probability is 
0.90, and the chief engineer feels (Somewhat more pessimistically) that it is 
0.60. 

(a) If the managing director of the firm feels that in this matter the purchas- 
ing agent is 10 times as reliable as the chief engineer while the head of 
the quality control department is 14 times as reliable as the chief engi- 
neer, what prior probabilities would he assign to the respective claims ? 

(b) If five of the shipments are inspected and only two meet specifications, 
how would this affect the prior probabilities; that is, what posterior 
probabilities should the managing director of the firm assign to the 
respective claims. 


e output of a certain transistor production line is checked daily by inspect- 
ing a sample of 200 units. Over a long period of time, the process has main- 
tained a yield of 80 percent, that is, a proportion defective of 0.20, and the 
variation of the proportion defective (from day to day) is measured by a 
standard deviation of 0.0125. 

(a) Verify that this prior distribution of the proportion defective can be 
approximated with a beta distribution having ® = 200 and B = 800. 

(b) If on a given day the sample (of 200 units) contains 86 defectives, esti- 
mate that day’s proportion defective in terms of the mean of the corre- 
sponding posterior distribution. 
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17. Verify for the example on page 248 that the standard deviation of the poste- 
rior distribution is 0, = 0.010. 


18. Records of the dean of an engineering school (collected over many years) 
show that on the average 75 percent of all applicants have an I.Q. of at least 
115. Of course, the percentage varies somewhat from year to year and this 
variation is measured by a standard deviation of 2.15 percent. 

(a) Verify that the prior distribution of the proportion of applicants with an 

1.Q. of at least 115 can be approximated with a beta distribution having 
o& = 300 and B = 100. 

(b) If a sample check of 25 applicants in the year 1976 showed that only 12 
of them have an I.Q. of at least 115, estimate that year’s proportion of 
fapplicants with an I.Q. of at least 115 in terms of the mean of the corre- 
sponding posterior distribution. 






3 HYPOTHESES CONCERNING ONE 
PROPORTION 


Many of the methods used in sampling inspection, quality control, 
and reliability verification are based on tests of the null hypothesis that 
a proportion (percentage, or probability) equals some specified constant. 
The details of the application of such tests to quality control will be 
discussed in Chapter 14, where we shall also go into some problems of 
sampling inspection; applications to reliability and life testing will be taken 
up in Chapter 15. 

Although there are exact tests based on the binomial distribution 
(which can be constructed with the use of Table 1), we shall consider here 
only approximate large-sample tests (n > 100) based on the normal 
approximation to the binomial distribution. In other words, we shall 
test the null hypothesis p = p, against one of the alternatives p < py, 
P > Po, OF Pp ¥ Py With the use of the following statistic* 


a XS MP 
/np (1 — po) 


If the null hypothesis is true, the sampling distribution of this statistic is 
approximately the standard normal distribution, and the critical regions 
for the test are as shown in the following table: 


*Some authors write the numerator of this formula for z as x -- 4 — mpo, whichever 
is numerically smaller, but there is generally no need for this continuity correction so 
long as nis large. 
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CRITICAL REGIONS FOR TESTING Ho: p = Po 
(Large Samples) 





Alternative 

hypothesis Reject Ho if 
D<Po a Sy, 
P > Po Z> Zo, 
P# Po Z< —2Zy/2 


Or Z > Ze/2 


(If the sample size is not large enough to use the normal approximation of 
the binomial distribution, we have to use the exact test mentioned above. 
For instance, to test the null hypothesis p = py against the alternative 
Pp < Do, we use the critical region x < k,, where k, is the /argest integer 
such that B(k,3;7, Po) < &.) 


Example. To illustrate this kind of test, suppose we want to investigate 
whether certain detonators used with explosives in coal mining meet 
the requirement that at least 90 percent of the detonators will ignite 
the explosive when charged. Thus, we shall want to test the null 
hypothesis p = 0.90 against the alternative hypothesis p < 0.90, 
say, at the level of significance « = 0.05. If 174 of 200 detonators 
function properly, we get 


174 — 200(0.90) _ 


~ ./200(0.90)(0.10) ail 


a 


and since this is not less than —1.645, the value of —Zy.o5, we find 
that the null hypothesis cannot be rejected. In other words, there is 
not sufficient evidence to say that in general the detonators will fail 
to meet the required standard. 


- a HYPOTHESES CONCERNING SEVERAL 
PROPORTIONS | 


When we compare the consumer response (percentage favorable and 
percentage unfavorable) to two different products, when we decide whether 
the proportion of defectives of a given process remains constant from day 
to day, when we judge whether there is a difference in political persuasion 
among several nationality groups, and in many similar situations, we are 
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interested in testing whether two or more binomial populations have the 
same parameter p. Referring to these parameters as p,,p,,..., and 
Px, We are, in fact, interested in testing the null hypothesis 


Pi=Pr_=...-=DPy 


against the alternative that at least two of these population proportions 
are unequal. To perform a suitable large-sample test of this hypothesis, 
we require independent random samples of sizen,,n,,..., and n, from 
the A populations; then, if the corresponding numbers of “successes” are 
X1,X2,..., and x,, the test we shall use is based on the fact that (1) for 
large samples the sampling distribution of 


X; — Nn;p; 


Z; = CV C—O. 
Jn; pi —= py) 


Is approximately the standard normal distribution, (2) the square of a 
random variable having the standard normal distribution is a random 
variable having the chi-square distribution with 1 degree of freedom, and 
(3) the sum of k independent random variables having chi-square distribu- 
tions with 1 degree of freedom is a random variable having the chi-square 
distribution with k degrees of freedom. (Proofs of these last two results 
may be found in the book by J. E. Freund mentioned in the Bibliography.) 
Thus, the sampling distribution of the statistic 


i (x; = 1:P;)? 
x 2 nipd1 — p;) 


is approximately the chi-square distribution with k degrees of freedom. 
(This approximation is usually quite close so long as n,p; > 5 for all i.) 
Now, if the null hypothesis is true, and p, = p, =... = p, = p, the value 
of the above y? statistic becomes 
k 2 
an (x; — mp) 
‘ ~ n,p(l — p) 
and in actual practice we substitute for p (which, of course, is unknown) 
the pooled estimate 


Ny Nee ee ee ey 


ads Se ane poi ee 


Since the null hypothesis should be rejected if the differences between the 
x, and the n; p are large, the critical region is y? > y2, where y? is as defined 


255 9.4 Hypotheses Concerning Several Proportions 


on page 176 and the number of degrees of freedom is k — 1. The loss of 
one degree of freedom results from the fact that p is estimated by p. 

We shall not illustrate this test right away, because in practice it is 
convenient to perform the test or express the 7? statistic in different 
(though equivalent) ways. In the special case where k = 2, in which 
we are testing the null hypothesis that two population proportions are 
equal, we base our decision on the statistic 





whose sampling distribution is approximately the standard normal 
distribution. The test based on this statistic is equivalent to the y* test 
described above, in the sense that the square of this z statistic actually 
equals the above y? statistic with k = 2 (see Exercise 8 on page 259). How- 
ever, it is preferable to use the z statistic, as it enables us to test the null 
hypothesis p, = p, against one-sided as well as two-sided alternatives; 
this is not the case for the y? statistic, with which we can test only against 
the two-sided alternative p, ~ p,. The resulting critical regions based on 
the above z statistic are 


CRITICAL REGIONS FOR TESTING Ho: pi = P2 
(Large Samples) 


Alternative 
hypothesis Reject Ho if 


pi = P2 Em Ze 


2 2h 


Zz < —Za/2 
Or Z = Za,/2 





Example. To illustrate this kind of test, suppose that 16 of 200 tractors 
produced on one assembly line required extensive adjustments before 
they could be shipped, while the same was true for 14 of 400 tractors 
produced on another assembly line. To decide whether the difference 
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between the corresponding proportions, namely, a& = 0.08 and 
Zo45 = 0.035, can be attributed to chance, let us test the null hypothesis 
P: = P2 against the alternative hypothesis p, ~ p, at the level of 
significance «& = 0.05. Substituting x, = 16, n, = 200, x, = 14, and 


n, = 400 into the formulas for f and z, we obtain 


- 16+ 14 


= 300 + 400 ~ 29° 
and 
6 14. 
200 400 


00 
— FOUN Pako > 238 


Since this exceeds z) 9.5; = 1.96, we find that the null hypothesis will 
have to be rejected. In other words, we conclude that the true pro- 
portion of tractors requiring extensive adjustments is not the same 
for the two assembly lines. 


The test we have described here applies to the null hypothesis p, = p,, but 
it can easily be modified (see Exercise 14 on page 260) so that it applies 
also to the null hypothesis p, — p, = 6, where 6 is not necessarily zero. 

When we apply the chi-square criterion on page 254 to the comparison 
of more than two sample proportions, it is convenient to look at the data 
as arranged in the following fashion: 





Sample 1 Sample 2 pi Sample k Total 
Successes x4 x2 was ba x 
Failures Ny — X41 nz — x2 fads Nk — Xx n —x 
Total ny n2 mee Nk n 


The notation is the same as before, except for x and n, which represent, 
respectively, the total number of successes and the total number of trials 
for all samples combined. With reference to this table, the entry in the 
cell belonging to the ith row and jth column is called the observed cell 
frequency f,,, with i= 1,2 andj=1,2,...,k. 

Under the null hypothesis that p, = p, =... = p, = p, we estimate 
p, as before, as the total number of successes divided by the total number 
of trials, which we now write as p = x/n. Hence, the expected number of 
successes and failures for the jth sample are estimated, respectively, as 
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Ny X 
n 


and eo; = nl fas Dp) = ata x) 





C1; — n;* p ——= 
| 


The quantities e,, and e,, are referred to as the expected cell frequencies 
e,,, withi = 1,2andj=—1,2,...,k. Thus, the expected frequency for any 
given cell may be obtained by multiplying the totals of the row and column to 
which the cell belongs, and then dividing by the grand total n. 

Using this new notation, it can be shown (see Exercise 16 on page 260) 
that the y? statistic on page 254 can also be written in the form 


2 k 6.) = .,)2 
xy? = ps pa (fii €1;) 


This formula has the advantage that it can easily be extended to the more 
general case, to be treated in Section 9.5, where each trial permits more 
than two possible outcomes, and there are, thus, more than two rows in 
the tabular presentation of the various frequencies. 


Example. To illustrate the use of this y* statistic, suppose that samples 
of three kinds of materials, subjected to extreme temperature changes, 
produced the results shown in the following table: 


Material A Material B Material C Total 


Crumbled 41 27 22 90 


Remainea 79 53 78 210 
intact 
Total 120 80 100 300 


The expected frequencies for the first two cells of the first row are 


90-120 


90-80 
€11 = 399 == 24 


= 36 and C45 = “300 = 

and, as it can be shown that the sum of the expected frequencies for 
each row or column equals that of the corresponding observed frequencies 
(see Exercise 20 on page 261), we find by subtraction that e,, equals 
90 — (36 + 24) = 30, and that the expected frequencies for the second 
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row equal e,, = 120 — 36 = 84, e,, = 80 — 24 = 56, and e,, = 100 
— 30 = 70. Substituting these values together with the observed cell 
frequencies into the formula for the y? statistic with k = 3, we obtain 


, __ (41 — 36)? | (27 — 24)? , (22 — 30 
ae iar: eed aT aaa ia, 


(79 — 84)? (53 — 56)? , (78 — 70) 
a ee a 


= 4.575 


Since this is less than 5.991, the value of y2,; fork —1=2 degrees 
of freedom, we find that the null hypothesis cannot be rejected. In other 
words, the data do not refute the claim that the probability of crum- 
bling when subjected to extreme temperature changes is the same for 
the three kinds of material. 


It is customary in problems of this kind to round the expected cell fre- 
quencies to the nearest integer or to one decimal. Most of the entries of 
Table 5 are given to three decimals, but there is seldom any need to carry 
more than two decimals when calculating the value of the y? statistic. 


EXERCISES 


1. A manufacturer of submersible pumps claims that at most 30 percent of his 
pumps require repairs within the first five years they are in operation. What 
can we conclude about this claim at the level of significance « = 0.05, if 
among 120 of these pumps there are 47 which require repairs within the first 

ve years they are in operation? 


“4. The performance of a computer is observed over a period of two years to 
check the claim that the probability is 0.20 that its down time will exceed 
five hours in any given week. Testing the null hypothesis p = 0.20 against 
the alternative hypothesis p ~ 0.20, what can we conclude at the level of 
significance & = 0.05, if there were only 11 weeks in which the down time of 
the computer exceeded five hours? 


3. To check on an ambulance service’s claim that at least 40 percent of its calls 
are life-threatening emergencies, a random sample was taken from its files, 
and it was found that only 49 of 150 calls were life-threatening emergencies. 
Test the null hypothesis p = 0.40 against a suitable alternative at the level 
of significance «0 = 0.01. 
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Pn that a nutritionist claims that at most 75 percent of the pre-school 
children in a certain country have protein deficient diets. What can we con- 
clude about this claim at the level of significance & = 0.01, if 246 of 300 
pre-school children (included in a sample survey conducted in that country) 
are found to have protein deficient diets? 


5. Suppose that 4 of 13 undergraduate engineering students state that they will 
go on to graduate school. Test the dean’s claim that 60 percent of the under- 
graduate students will go on to graduate school, using the alternative 
hypothesis p < 0.60 and the level of significance & = 0.05. [Hint: Use Table 
1 to determine the probability of getting “at most 4 successes in 13 trials” 
when p = 0.60.] 


6. Suppose we want to test the “honesty” of a coin on the basis of the number 
of heads we will get in 15 flips. Using Table 1, determine how few or how 
many heads we would have to get so that we could reject the null hypothesis 

— 0.50 against the alternative hypothesis p 4 0.50 at the level of signifi- 
cance & = 0.05. What is the actual level of significance we would be using 
with this criterion ? 


“It costs more to test a certain type of ammunition than to manufacture it, 
and, hence, only three rounds are tested from each large lot. If the lot is 
rejected unless all three rounds function according to specifications, 

(a) sketch the OC curve for this test; 
(b) find the actual proportion of defectives for which the test procedure will 
cause a lot to be rejected with a probability of 0.10. 


8. Show that the square of the z statistic on page 255 equals the chi-square 


statistic on page 254 for k = 2. 

Pt manufacturer of electronic equipment subjects samples of two competing 
brands of transistors to an accelerated performance test. If 45 of 180 transis- 
tors of the first kind and 34 of 120 transistors of the second kind fail the test, 
what can he conclude at the level of significance &# = 0.05 about the dif- 
ference between the corresponding sample proportions? 


10. If one method of producing rain by “seeding” clouds was successful in 16 of 
50 attempts, while another method was successful in 29 of 80 attempts, can 
we conclude at the level of significance & = 0.05 that the second method is 

‘tter than the first? 


M1. Two groups of 80 patients each took part in an experiment in which one 
group received pills containing an anti-allergy drug, while the other group 
received a placebo (a pill containing no drug). If in the group given the drug 
23 exhibited allergic symptoms while in the group given the placebo 41 
exhibited such symptoms, is this sufficient evidence to conclude at the level 
of significance & = 0.01 that the drug is effective in reducing these symp- 
toms? 


12. A study showed that 84 of 200 persons who saw a deodorant advertised 
during the telecast of a football game and 96 of 200 persons who saw it adver- 
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13. 


14. 


15. 


16. 


17. 


18. 


tised on a variety show remembered two hours later the name of the deodor- 
ant. Use the level of significance & = 0.05 to test the null hypothesis that 
there is no difference between the corresponding “true” proportions. 


In a sample of the visitors to a cavern (a tourist attraction which had once 
served as a hide-out to a notorious outlaw), 86 of 250 men and 152 of 250 
women bought souvenirs. Test whether the difference between the corre- 
sponding sample proportions is significant, using the level of significance 
a = 0.01 and 

(a) the chi-square statistic given on page 254; 

(b) the z statistic given on page 255. 

Also verify that the square of the value obtained for the z statistic equals that 
obtained for the chi-square statistic. 


If we want to test the null hypothesis that the difference between two popula- 
tion proportions equals some constant 6, not necessarily 0, we can base our 
decision on the statistic 


PhS 
LS a, ee ee 
“(4 2 1) 2 (4 = x2) 
ny ny N2 No 
Na Sieh Scare eee AED a. a ee eran Ae 
ny Nz 


whose sampling distribution is approximately the standard normal distribu- 
tion so long as n; and n, are both large. Referring to Exercise 11, use this 
theory and the level of significance & = 0.05 to test whether the percentage 
of patients exhibiting allergic symptoms is at least 8 percent less for those 
who actually receive the drug. 


In a true-false test, a test item is considered to be good if it discriminates 
between well-prepared students and poorly-prepared students. If 205 of 250 
well-prepared students and 137 of 250 poorly-prepared students answer a 
certain item correctly, use the theory of Exercise 14 and the level of signifi- 
cance &® = 0.05 to test whether for the given item the proportion of correct 
answers will in general be at least 20 percent higher among well-prepared 
students than among poorly-prepared students. 


Verify that the two formulas given for the Y2 statistic on pages 254 and 257 
are equivalent. 


If 26 of 200 tires of Brand A failed to last 20,000 miles, while the corre- 
sponding figures for 200 tires each of Brands B, C, and D were 23, 15, and 
32, use the level of significance & = 0.05 to test the null hypothesis that there 
is no difference in the quality of the four kinds of tires. 


Tests are made on the proportion of defective castings produced by five 
different molds. If there were 14 defectives among 100 castings made with 
mold I, 33 defectives among 200 castings made with mold II, 21 defectives 
among 180 castings made with mold ITI, 17 defectives among 120 castings 
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made with mold IV, and 25 defectives among 150 castings made with mold 
V, test (at the 0.05 level of significance) whether the true proportion of defec- 
tives is the same for each mold. 

19. Using a level of significance of & = 0.05, can it be concluded from the fol- 


lowing sample data that the proportion of employees favoring a new pension 
plan is not the same for three different government agencies: 





Agency I Agency 2 Agency 3 
shu 67 84 109 
pension plan 
Against the 33 66 Al 
pension plan 





20. Verify that if the expected frequencies are determined as indicated on page 
257, the sum of the expected frequencies for each row or column equals the 
sum of the corresponding observed frequencies. 


9.5 THE ANALYSIS OF r-by-k TABLES 


As we suggested on page 257, the method by which we analyzed the 
last example of the preceding section lends itself also to the analysis of 
so-called r-by-k tables, that is, tables in which observed frequencies are 
arranged in r rows and k columns. Such tables arise in essentially two 
kinds of problems. First, we might again have samples from & populations, 
with the distinction that now each trial permits more than two possible 
outcomes. This might happen, for example, if persons belonging to dif- 
ferent income groups are asked whether they favor a certain political 
candidate, whether they are against him, or whether they are indifferent 
or undecided. The other situation giving rise to an r-by-k table is one in 
which we sample from one population but classify each item with respect 
to two (usually qualitative) categories. This might happen, for example, 
if a consumer testing service rates cars as excellent, superior, average, or 
poor with regard to performance and also with regard to appearance. 
Each car tested would then fall into one of the 16 cells of a 4-by-4 table, 
and it is mainly in connection with problems of this kind that r-by-k 
tables are referred to as contingency tables. 

The essential difference between the two situations giving rise to 
r-by-k tables is that in the first case the column totals (the sample sizes) 
are fixed, while in the second case only the grand total (the total for the 
entire table) is fixed. As a result, there are also differences in the null 
hypotheses we shall want to test. In the first case we want to test whether 
the probability of obtaining an observation in the ith row ts the same for 
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each column; symbolically, we shall want to test the null hypothesis 
Pu Pa Ht es — Py (OP Pa heat 


where p,,; is the probability of obtaining an observation belonging to the 
ith row and the jth column. The alternative hypothesis is that the p’s 
are not all alike at least for one row. In the second case we shall want to 
test whether the random variables represented by the two classifications are 
independent; symbolically, we shall want to test the null hypothesis 


a re 


p= (PD NPs for 
Pi; = (Di. D.;) eee 


where p,, is the probability of obtaining an observation belonging to the 
ith row and p,, is the probability of obtaining an item belonging to the 
jth column. The alternative to this null hypothesis is that the equality does 
not hold for at least one pair of values of i and /. 

In spite of the differences we have described, the analysis of an r-by-k 
table is the same for both cases. First we calculate the expected cell fre- 
quencies e,, as on page 257, namely, by multiplying the totals of the 
respective rows and columns and then dividing by the grand total. In 
practice, we make use of the fact that the observed frequencies and the 
expected frequencies total the same for each row and column, so that 
only (r — 1)(k — 1) of the e,, have to be calculated directly, while the 
others can be obtained by subtraction from appropriate row or column 
totals. We then substitute into the formula 


= 3 > Sip Or)” 


Ci; 





and we reject the null hypothesis if the value of this statistic exceeds y? 
with (r — 1)(k — 1) degrees of freedom. [This expression for the number 
of degrees of freedom is justified by the above observation that after we 
choose (r — 1)(kK — 1) of the expected cell frequencies, the others are 
automatically determined, that is, they may be obtained by subtraction 
from appropriate row or column totals.] 


Example. To illustrate the general approach, suppose that the per- 
sonnel manager of a large company wants to know whether there 
really is a relationship between an employee’s performance in the com- 
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pany’s training program and his (or her) ultimate success in the job. 
Suppose, furthermore, that a sample of 400 cases (taken from the 
company’s very extensive files) yielded the results shown in the fol- 
lowing table: 


Performance in training program 


Below Above 
average Average average Total 
Poor 23 60 29 112 
Success in job 
(Employer’s rating) Average 28 19 60 167 
Very good 9 49 63 121 
Total 60 188 152 400 


Calculating first the expected cell frequencies for the first two cells of 
the first two rows, we get 





— 112-60 _ — 112-188 _ 
_ 167-60 _ 167-188 
ea = a ZU, (= ay = 78.5 
and then, by subtraction, we find that the other expected cell fre- 
quencies are e,; = 42.6, e,; = 63.5, €3; = 18.2, e3. = 56.9, and e;; = 
45.9. Thus, 
» _ (23 ~ 16.8)? | (60 — 52.6)? , (29 — 42.6)? 
i 16.8 52.6 42.6 
4 28 — 25.0)? 4 (79 — 78.5)" (60 — 63.5)? 
25.0 18:93 63.5 
_@ — 18.2)? 4 (49 — 56.9)? de (63 — 45.9)? 
18.2 56.9 45.9 
= 20.34 


and since this exceeds 9.488 as well as 13.277, the values of 76.05 
and y2.,, for (3 — 1)(3 — 1) = 4 degrees of freedom, we find that the 
null hypothesis will have to be rejected. We conclude that there is some 
relationship (some dependence) between an employee’s performance 
in the training program and his (or her) success in the job. 
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9.6 GOODNESS OF FIT 


We speak of “goodness of fit” when we try to compare an observed 
frequency distribution with the corresponding values of a theoretical 
distribution. 


Example. To illustrate, suppose that during 400 five-minute intervals 
the air-traffic control of an airport received 0, 1, 2,..., and 13 radio 
messages with respective frequencies of 3, 15, 47, 76, 68, 74, 46, 39, 
15, 9, 5, 2, 0, and 1. Suppose, furthermore, that we want to check 
whether these data substantiate the claim that the number of radio 
messages which they receive during a five-minute interval may be 
looked upon as a random variable having the Poisson distribution 
with A = 4.6. Looking up the corresponding Poisson probabilities 
in Table 2 and multiplying them by 400 to get the expected frequencies, 
we arrive at the result shown in the following table together with the 
original data: 





Number of Observed Poisson Expected 
radio messages frequencies probabilities frequencies 
0 3 0.010 4.0 
1 a is 0.046 isl nee 
2 47 0.107 42.8 
3 76 0.163 65.2 
4 68 0.187 74.8 
5 74 0.173 69.2 
6 46 0.132 52.8 
7 39 0.087 34.8 
8 15 0.050 20.0 
9 9 0.025 10.0 
10 5 0.012 4.8 
11 2 0.005 2.0 
12 o( 0.002 oer 
13 1 0.001 0.4 
400 400.0 


Evidently, there are some discrepancies between the observed frequen- 
cies and the expected frequencies, but it remains to be seen whether 
they are significant or whether they can be attributed to chance. 


An appropriate test of the null hypothesis that a set of data comes 
from a population having a given distribution (against the alternative 
that the population has some other distribution) can be based on the 
Statistic 
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y= y CE = é;)” 


l 


where the f, and e, are, as before, the corresponding observed and expected 
frequencies. The sampling distribution of this statistic is approximately 
the chi-square distribution with k — m degrees of freedom, where k is 
the number of terms in the formula for y* and mis the number of quan- 
tities, obtained from the observed data, that are used in calculating the 
expected frequencies. Note that in our example we had to know only the 
total frequency, 400, to calculate the e;, so that m equalled 1. 


Continuation of Example. To satisfy the rule on page 254, according 
to which none of the expected frequencies in a chi-square comparison 
should be less than 5, we shall follow the simple practice of combining 
adjacent classes, the first two and the last four as indicated in the 
table on the preceding page. Thus, we get 


— 10.0)? — 8.0)? 


(18 — 22.4)? (47 — 42.8)? (9 (8 
ans eee: go Fee agg Oe ORD 


4 42 

= 6.749 
Since this is less than 16.919, the value of y2 ); for 10 — 1 = 9 degrees 
of freedom, the null hypothesis cannot be rejected at the 0.05 level 


of significance, and we conclude that the Poisson distribution with 
A = 4.6 provides a good fit. 


EXERCISES 
1. The results of polls conducted two weeks and four weeks before a guberna- 
torial election are shown in the following table: 


Two weeks Four weeks 
before election before election 


For Republican 79 91 
candidate 
For Democtatte 84 66 
candidate 


Undecided 37 43 
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Use the level of significance & = 0.05 to decide whether there has been a 
change in opinion during the two weeks between the two polls. 


2. Suppose that in Exercise 17 on page 260 we had been interested also in how 
many of the tires last more than 30,000 miles, and obtained the results shown 
in the following table: 


Brand A BrandB  BrandC Brand D 
Failed to last 


20,000 miles 26 23 15 32 
Lasted from 20,000 

to 30,000 miles 118 93 116 121 
Lasted more than 56 BA fo Fe 


30,000 miles — 


Use the level of significance & = 0.01 to test the null hypothesis that there is 
no difference in the quality of the four kinds of tires. 


3. A large electronics firm which hires many handicapped workers wants to 
determine whether their handicaps affect such workers’ performance. Use 
the level of significance & = 0.05 to decide on the basis of the sample data 
shown in the following table whether it is reasonable to maintain that the 
handicaps have no effect on the workers’ performance: 


Performance 
Above Below 
average Average average 


Blind 21 64 17 
Deaf 16 49 14 
No handicap 29 93 28 


4. Tests of the fidelity and the selectivity of 190 radio receivers produced the 
results shown in the following table: 


Fidelity 
Low Average High 


Low 6 12 32 
Selectivity Average 33 61 18 
High 13 15 0 


Use the level of significance & = 0.01 to show that there is, indeed, a relation- 
ship (dependence) between fidelity and selectivity. 
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5. A quality control engineer takes daily samples of four tractors coming off an 
assembly line and on 200 consecutive working days he obtains the data 
summarized in the following table: 


Number 
requiring Number 
adjustments of days 
0 101 
1 79 
2 19 
3 1 


To test the claim that 10 percent of all the tractors coming off this assembly 
line require adjustments, look up the corresponding probabilities in Table 1, 
calculate the expected frequencies, and perform the chi-square test at the 
level of aignificance & = 0.01. 


6. With reference to Exercise 5, verify that the mean of the given distribution is 
0.60, corresponding to 15 percent of the tractors requiring adjustments. Then 
look up the probabilities for n = 4 and p = 0.15 in Table 1, calculate the 
expected frequencies, and then test at the level of significance & = 0.01 
whether the binomial distribution with n = 4 and p = 0.15 provides a sui- 
table model for this situation. 


7. Suppose that in the example on page 264 we had shown first that the mean of 
the distribution, rounded to one decimal, is 4.5, and then tested whether the 
Poisson distribution with 2 = 4.5 provides a good fit. What would have 
been the number of degrees of freedom for the appropriate chi-square criter- 
ion? 


8. The following is the distribution of the daily number of power failures 
reported in a western city on 300 days: 


Number of Number of 

power failures days 
0 9 
1 43 
2 64 
3 62 
4 42 
5 36 
6 22 
7 14 
8 6 
9 2 


Test at the level of significance & = 0.05 whether the daily number of power 
failures in this city may be looked upon as a random variable having the 
Poisson distribution with A = 3.2. 
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9. 


10. 


11. 


The following is the distribution of the hourly number of trucks arriving at 
a company’s warehouse: 


Trucks arriving 
per hour Frequency 


D2 
151 
130 
102 

45 

12 

3 
1 
2 


COND UN RWN kK CO 


Find the mean of this distribution, and using its mean (rounded to one 
decimal) as the parameter J, fit a Poisson distribution. Test for goodness of 
fit at the level of significance & = 0.05. 


Using any four columns of Table 7 (that is, a total of 200 random digits), 
construct a table showing how many times each of the digits 0, 1,..., and 9 
occurred. Compare these observed frequencies with the corresponding 
expected frequencies (based on the assumption that the digits are randomly 
generated) by means of the chi-square statistic, and test at the level of sig- 
nificance & = 0.05 whether the assumption of randomness is tenable. 


The following is the distribution of the sulfur oxides emission data obtained 
on page 137: 


Class limits 


(tons) Frequency 
5.0- 8.9 3 
9.0-12.9 10 

13.0-16.9 14 
17.0-—20.9 25 
21.0—24.9 17 
25.0-28.9 9 
29.0-32.9 2 

80 


As we showed on page 155, the mean of this distribution is x = 18.85 and 

its standard deviation is s = 4/30.77 = 5.55. 

(a) Find the probabilities that a random variable having a normal distribu- 
tion with 4 = 18.85 and o = 5.55 takes on a value between 4.95 and 
8.95, between 8.95 and 12.95, between 12.95 and 16.95, between 16.95 
and 20.95, between 20.95 and 24.95, between 24.95 and 28.95, and be- 
tween 28.95 and 32.95. 
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12, 


(b) Multiply the probabilities obtained in part (a) by the total frequency (in 
this case n = 80), thus getting the expected normal curve frequencies 
corresponding to the seven classes of the given distribution. 

(c) Test the null hypothesis that the given data may be looked upon as a 
random sample from a normal population by comparing the observed 
and expected frequencies with an appropriate Y? statistic (using & = 
0.05). Explain why the number of degrees of freedom for this 7¥?2 test is 
given by k — 3, where k is the number of terms in the ¥? statistic. 


Among 100 vacuum tubes used in an experiment, 46 had a service life of less 
than 20 hours, 19 had a service life of 20 or more but less than 40 hours, 17 
had a service life of 40 or more but less than 60 hours, 12 had a service life of 
60 or more but less than 80 hours, and 6 had a service life of 80 hours or 
more. Using steps similar to those outlined in Exercise 11, test whether these 
lifetimes may be regarded as a sample from an exponential population with 
jt = 40 hours. Use the level of significance & = 0.01. 





Nonparametric 
Methods 





10.1 INTRODUCTION 


Most of the methods of inference which we have studied so far are based 
on the assumption that the observations are taken from normal popula- 
tions. These methods extract all the information that is available in a 
sample, and they usually attain the best possible precision, that is, the most 
reliable results. As we have pointed out earlier, the assumption that the 
samples are taken from normal populations is not really as stringent as it 
may seem. Most statistical methods based on the normal distribution are 
fairly robust, that is, they will give reasonably accurate answers even when 
the normality assumption is only satisfied in an approximate sense. In 
spite of this, there are several reasons why we may wish to use other, less 
precise methods—the assumption of normality may be grossly incorrect, 
the labor involved in carrying out the more precise methods may be exces- 
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sive, or a short-cut method may be desired to determine in advance 
whether it is worthwhile to carry out the more detailed calculations. 


Several short-cut methods have already been introduced, primarily as 
labor-saving devices. For example, on page 145 we introduced a quick 
method of estimating the mean and the standard deviation of normally 
distributed data, using the 50 percent and 84 percent points on a “prob- 
ability” graph. On page 154 we discussed a method of coding which can 
materially reduce the time required to calculate the mean and the standard 
deviation of a set of data without any loss in precision. In Chapter 5 we 
observed that these calculations can further be simplified with only a minor 
loss in precision by grouping the observations into a frequency table. 

Other short-cuts connected with point estimation were introduced in 
Chapters 7 and 8. We discussed the sample median, which often can be deter- 
mined more easily and more rapidly than the mean, and, as was pointed out 
on page 181, the median gives an unbiased estimate for the mean of a sym- 
metric population. Also, the median is superior to the mean as a measure 
of “location” of a highly skewed population. The sample range was intro- 
duced on page 228 as an estimator for the standard deviation of a normal 
population; it is obtained far more quickly than the sample standard 
deviation, and its precision is nearly that of s for small samples. 

If there is a choice between several statistical methods which can all be 
used in a given situation, the criterion of efficiency is most commonly used 
as an appropriate guide. If we think of the methods based on the assump- 
tion of normality as being fully (100 percent) efficient, we can use this as 
a yardstick for measuring the merits of any other method. The most widely 
used measure of efficiency is based on the sample sizes required to give 
equally precise results by a given method and by the corresponding method 
which is fully efficient. For example, in estimating the mean of a normal 
population, the most efficient method involves use of the sample mean <x. 
If we wanted to use the median instead of the mean, we would have to 
take into account that the variance of the sampling distribution of the 


2 
median is approximately 1.57 - so that the efficiency of the median is 


1/1.57 or approximately 64 percent. In other words, the median based on 
a sample of size 100 gives as reliable an estimate of the mean of a normal 
population as the mean based on a sample of size 64. Similarly, it can be 
shown that the efficiency of the range estimator for the standard deviation 
of anormal population decreases as the sample size increases; the efficiency 
is 100 percent for samples of size 2, 96 percent for samples of size 5, and 
81 percent for samples of size 15. 

Certain methods of inference have the important advantage that they 
do not require the stringent assumptions of the methods based on the 
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normal distribution, and they usually have the additional advantage that 
they require less burdensome calculations. They include nonparametric 
methods as well as distribution-free methods, where the first term applies 
when we are not concerned directly with the parameters of populations 
of a given kind, and the second term applies when we make no assumptions 
about the populations from which we are sampling, except perhaps that 
they must be continuous. Actually, since this distinction is rather fine, it 
has become the custom to refer to either kind of method simply as non- 
parametric. 

The main advantage of nonparametric methods is that exact inferences 
can be made when the assumptions underlying so-called “standard” 
methods cannot all be met. The main disadvantage of nonparametric 
methods is that they may be wasteful of information, and usually have a 
smaller efficiency than the corresponding parametric methods provided 
that the assumptions of the standard (parametric) methods can be met. Thus, 
if we say that the efficiency of a certain nonparametric method is 80 per- 
cent, we may actually be understating its relative worth, for the efficiency 
of the corresponding “standard” method will be somewhat less than 100 
percent if all assumptions are not exactly met. 

In this chapter we shall outline a variety of useful methods, which can 
be used in place of the corresponding standard methods (described in 
Chapters 7 through 9) whenever the assumptions are not met or there is a 
need for greater ease in calculations. Certain other nonparametric methods, 
which can be used in place of “standard” methods not yet introduced, will 
be desefibed along with the new methods in subsequent chapters. 


10.2 THE SIGN TEST 


In this section we shall describe nonparametric tests that are based on 
classifying the data according to two attributes, conveniently represented 
by plus signs and minus signs. For instance, in the one-sample case we can 
test the null hypothesis 4 = My, on the basis of a random sample of size n, 
by replacing each observation exceeding “, with a plus sign and each 
observation exceeded by yw, with a minus sign. If the population from 
which we are sampling is continuous and symmetrical, the probability that 
an observation is thus replaced by a plus sign equals 4 when H, is true. 
Consequently, the test of the null hypothesis uw = 4, becomes equivalent 
to a test of the null hypothesis p = 4, where p is the parameter of a bino- 
mial distribution. The two-sided alternative “4 +4 MU, is now equivalent to 
p # i, and the one-sided alternatives uw < uy and “> My are equivalent 
to p <4 and p > 4, respectively, where p is the probability of getting a 
plus sign, namely, an observation greater than sy. (If a sample value 
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happens to equal “4, which is possible since the values of continuous 
random variables are virtually always rounded, it is simply discarded.) 


Example. Let us consider the following data, which are the octane 
ratings obtained for 15 samples of a certain kind of gasoline: 


101.0 103.3 101.8 102.5 101.7 98.2 101.1 104.5 
105.3. 99.4 102.4 100.9 100.3 100.0 103.6 


To test the null hypothesis that the true mean octane rating of this kind 
of gasoline is “4 = 100.0 against the alternative hypothesis uw > 100.0 
at the level of significance « = 0.05, we first replace each value Jess 
than 100.0 with a minus sign, each value greater than 100.0 with a plus 
sign, and we discard the one value which actually equalled 100.0. Thus, 
we get 


++ ++ + —- + + 4-4 4+ 4+ 4 


and it remains to be seen whether “12 successes in 14 trials” supports 
the null hypothesis p = 4 or the alternative hypothesis p > 4. Applying 
the exact test criterion given on page 241 with « = 0.05, we find from 
Table 1 that ko.9; = 10; since there were 12 plus signs, it follows that 
the null hypothesis will have to be rejected. (More directly, we could 
have argued that the probability of “12 or more successes in 14 trials” 
is only 0.0065 according to Table 1, so that the null hypothesis must be 
rejected.) We conclude that the mean octane rating of the given kind 
of gasoline is greater than 100.0. 


In this example, we had to refer directly to Table 1, but if the sample size 
is sufficiently large, we can use the normal curve approximation tothe 
binomial distribution and the tests given in the table on page 212. 

The sign test has important applications in problems where we are 
dealing with paired data, so that each pair can be replaced with a plus sign 
if the first value is greater than the second, a minus sign if the first value is 
smaller than the second, or be discarded if the two values are equal. As in 
Section 7.8 such problems arise in two kinds of situations, depending on 
whether the data are actually given as pairs as in the example on page 220, 
or whether they consist of independent random samples whose values are 
randomly paired. 


Example. To illustrate the first case, let us refer to the example on 
page 220, which dealt with the effectiveness of an industrial safety 
program. As can easily be verified the data for the 12 plants are 
replaced by 


+++ 4-4 - 4 - 4+ 4+ 4 
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and it remains to be seen whether “9 successes in 12 trials” enables us 
to reject the null hypothesis p = 4 against the alternative hypothesis 
p > tat the level of singificance « = 0.05. Since the probability of “9 
or more successes” is 0.0730 according to Table 1, it follows that the 
null hypothesis cannot be rejected. In other words, we cannot conclude 
that the industrial safety program is effective, and this illustrates the 
point made on page 272 that nonparametric methods can be wasteful 
of information. When we performed the ¢ test on page 220, we were 
able to conclude at the level of significance « = 0.05 that the industrial 
safety program is effective. 


Example. To illustrate the second case, where we deal with two inde- 
pendent samples, let us consider the following data pertaining to the 
noise level (in decibels) recorded at two busy intersections during early 
morning traffic: 


Intersection A: 69, 74, 77, 59, 80, 59, 71, 65, 62, 79, 
76, 60, 59, 64, 71, 63, 63, 67, 62, 71 


Intersection B: 59, 78, 53, 63, 67, 63, 59, 58, 64, 74, 
66, 62, 68, 68, 71, 70, 56, 55, 63, 68, 
i ae 


Note that there are two more values in the second sample, so that if we 
randomly match one value of the second sample with each value of the 
first sample, two values of the second sample will be left over. Actually, 
doing this with the use of random numbers, we obtain the pairs shown 
in the following table, where we indicated by means of a plus sign that 
the value for Intersection A is larger, and by means of a minus sign that 
the value for Intersection A is smaller: 


Intersection A Intersection B Intersection A Intersection B 
62 55 + 80 63 - 
71 68 + 64 71 — 
59 67 _ 71 59 + 
63 23 “- 69 63 ots 
77 63 + 63 58 ++ 
67 59 ++ 74 70 ++ 
79 64 -+ 71 78 — 
62 68 — 59 53 “ 
59 56 + 76 66 + 
60 74 -- 65 62 +- 


There are 15 plus signs and 5 minus signs, and we shall have to see 
whether “15 successes in 20 trials” supports the null hypothesis p = 4 
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(namely, the null hypothesis that the true average noise level at the two 
intersections is the same) or the alternative hypothesis p + 4. Since the 
probability of “15 or more successes” is 0.0207 according to Table 1, 
we find that the null hypothesis can be rejected at the level of signifi- 
cance «& = 0.05, but not at the level of significance « = 0.01. Note that 
if we had used the normal approximation of the binomial distribution, 
substitution of n = 20 and p = 4 would have yielded w = 20(4) = 10, 
o = /20(5)(5) = 2.236, and 


—_ 145-10 | 
Sale, oT eee 2.01 
and the result would have been exactly the same. Note that since 20 is 
small we made the continuity correction suggested in the footnote on 
page 252. 


10.3. RANK-SUM TESTS 


The paired-sample sign test is one of several nonparametric methods 
for testing the null hypothesis that two samples come from identical popu- 
lations against the alternative hypothesis that the populations have unequal 
means. A highly efficient class of nonparametric tests of this and similar 
hypotheses is based on rank sums; that is, the observations are assigned 
ranks according to their order of magnitude, and the tests are performed 
on the basis of certain sums of these ranks. In this section we shall intro- 
duce three tests based on rank sums. The Mann-Whitney U test will be 
presented as a substitute for the two-sample f¢ test, and it has a limiting 
efficiency of 95.5 percent when the assumptions underlying the corre- 
sponding f¢ test are satisfied. A test similar to the U test, which can be used 
when the alternative hypothesis specifies that the two populations have 
unequal dispersions, will be taken up next. Finally we shall introduce the 
Kruskal-Wallis H test, for testing whether k samples come from identical 
populations against the alternative that the populations have unequal 
means. Like the U test, the H test also has a limiting efficiency of 95.5 per- 
cent when compared with the corresponding “standard” procedure, which 
will be discussed in Chapter 12. 

Let us first describe the Mann-Whitney U test, a very popular test, by 
means of the following illustration: 


Example. In a study of sedimentary rocks, the following diameters 
(in millimeters) were obtained for samples of 29 grains from two kinds 
of sand: 
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SandI: 0.63, 0.17, 0.35, 0.49, 0.18, 0.43, 0.12, 0.20, 
0.47, 1.36, 0.51, 0.45, 0.84, 0.32, 0.40 

Sand IT: 1.13, 0.54, 0.96, 0.26, 0.39, 0.88, 0.92, 0.53, 
1.01, 0.48, 0.89, 1.07, 1.11, 0.58 


The means of these two samples are 0.46 and 0.77, respectively, and 
the problem is to decide whether their difference is significant. Note 
that we may well be reluctant to use the standard test of Section 7.8 
since the first sample shows considerably more variability than the 
second; its values range from 0.12 to 1.36, whereas those of the second 
sample range only from 0.26 to 1.13. 

To perform the Mann-Whitney test, we first rank the data jointly 
in an increasing (or decreasing) order of magnitude, and for our data 
we, thus, obtain the following array: 


0.12 0.17 0.18 0.20 0.26 0.32 0.35 0.39 0.40 0.43 
I I I I II I I II I I 


0.45 0.47 0.48 0.49 0.51 0.53 0.54 0.58 0.63 0.84 
I I II I I II II II I I 


0.88 0.89 0.92 0.96 1.01 1.07 1.11 1.13 1.36 
I II II II II II II II I 


Assigning the data in this order the ranks 1, 2, 3,..., and 30, we find 
that the values of the first sample (Sand I) occupy ranks 1, 2, 3, 4, 6, 7, 
9, 10, 11, 12, 14, 15, 19, 20, and 29, while those of the second sample 
(Sand II) occupy ranks 5, 8, 13, 16, 17, 18, 21, 22.23; 24, 25, 26,21, And 
28. There are no ties in this example among values belonging to dif- 
ferent samples, but if there were, we would assign to each of the tied 
observations the mean of the ranks which they jointly occupy. (Thus, if 
the third and fourth values were identical, we would assign each the 
3 > = 3.5: if the ninth, tenth, and eleventh values were identi- 


cal, we would assign each the rank a = 10, and so forth.) 


rank 





The null hypothesis we shall want to test is that both samples come 
from identical populations, and it stands to reason that in that case the 
means of the ranks, or the sums of the ranks assigned to the values of 
the two samples, should be more or less the same. For our two samples, 
the sums of the ranks are, respectively, 162 and 273, and it remains to 
be seen whether their difference is large enough to reject the null 
hypothesis. 


In the Mann-Whitney test, we actually base this decision on the statistic 
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U = n,n, + n+) _p, 


where 7, and n, are the respective sizes of the first and second samples, and 
R, is the sum of the ranks assigned to the values of the first sample. (In 
practice, we can designate the sample whose rank sum is most easily 
obtained as the “first sample,” as it is immaterial which sample is referred 
to as “first.”) 

Under the null hypothesis that the two samples come from identical 
populations, it can be shown that the mean and the variance of the sam- 
pling distribution of the U statistic are given by 





nn (nN, + nN, + 1) 
12 


ly = 5 and 07 = 





Note that if there are ties in rank, these formulas provide only approxima- 
tions, but if the number of ties is small,such approximations will generally 
be good. Furthermore, if both n, and n, are sufficiently large (in this case, 
greater than 8), the sampling distribution of the U statistic can be approxi- 
mated closely by a normal distribution and, hence, the test can be based 
on the statistic 





and Table 3. There also exist tables on which exact tests can be based 
when 7, and n, are small, and they may be found, for example, in the book 
by D. B. Owen mentioned in the Bibliography. 


Continuation of Example. Returning now to our numerical example, 
we haven, = 15, n, = 14, R; = 162, and, hence, 


ie 15-14 + 216 _ 162 = 168 


15414 
= 5 = 105 


gi = 15:14:30 _ 595 
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Thus, 


168 — 105 


La 2.74 


Z 


and since this value exceeds Z 995 = 2.58, the upper critical value for a 
two-sided alternative hypothesis and « = 0.01, we find that the null 
hypothesis will have to be rejected. We conclude that there is a difference 
between the true average grain size of the two kinds of sand. 


An interesting feature of the Mann-Whitney test is that, with a slight 
modification, it can also be used to test the null hypothesis that two sam- 
ples come from identical populations against the alternative that the two 
populations have unequal dispersions, namely, that they differ in variability 
or spread. As before, the values of the two samples are arranged jointly in 
an increasing (or decreasing) order of magnitude, but now they are ranked 
from both ends toward the middle. We assign rank 1 to the smallest value, 
ranks 2 and 3 to the largest and second largest values, ranks 4 and 5 to the 
second and third smallest, ranks 6 and 7 to the third and fourth largest, 
and so on. Subsequently, the calculation of U and the performance of the 
test are the same as before. The only difference is that with this kind of 
ranking a small rank sum tends to indicate that the population from which 
the sample was obtained has a greater variation than the other, because its 
values occupy the more extreme positions. We shall not attempt to illus- 
trate this technique with reference to our grain-size example, since this 
test loses its sensitivity for detecting differences in variability when the 
means of the populations are not the same. The reader will be asked to use 
it, though, in Exercises 13 and 14 on page 281. 

The Kruskal-Wallis H test for deciding whether k independent samples 
come from identical populations is conducted in a way similar to the U 
test. As before, the observations are ranked jointly, and if R; is the sum of 
the ranks occupied by the n,; observations of the ith sample, the test 1s 
based on the statistic 





where n =n, +n,+...-+n,. When n; > 5 for all i and the null hy- 
pothesis holds, the distribution of the H statistic is well approximated by 
the chi-square distribution with k — 1 degrees of freedom. Special tables 
applying for selected small values of the n; and k are referred to in the 
table by D. B. Owen mentioned in the Bibliography. 
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Example. To illustrate the Kruskal-Wallis H test, suppose that an 
experiment, designed to compare three preventive methods against cor- 
rosion, yielded the following maximum depths of pits (in thousandths 
of an inch) in pieces of wire subjected to the respective treatments: 


Method A: 77, 54, 67, 74, 71, 66 
Method B: 60, 41, 59, 65, 62, 64, 52 
Method C: 49, 52, 69, 47, 56 


If we rank these measurements jointly from smallest to largest, we find 
that those in the first sample are assigned the ranks 6, 13, 14, 16, 17, 
and 18, so that R, = 84; those in the second sample are assigned the 
ranks 1, 4.5, 8, 9, 10, 11, and 12, so that R, = 55.5; and those in the 
third sample are assigned the ranks 2, 3, 4.5, 7, and 15, so that R; = 
31.5. Substituting into the formula for H we thus obtain 


eee - 55,5? se 


~ 18-19\ 6~ 7 5 
67 


and, comparing this figure with 5.991 (the value of v3.,; for 2 degrees 
of freedom), we find that the null hypothesis will have to be rejected. 
Thus, the three preventive methods against corrosion are not equally 
effective. 


EXERCISES 


. With reference to the ignition times (of certain upholstery materials) of 
Exercise 6 on page 147, use the sign test to test the null hypothesis “ = 6.50 
seconds against the alternative hypothesis 44 < 6.50 seconds at the level of 


ignificance & = 0.01. 

a a laboratory experiment, 18 determinations of the coefficient of friction 
between leather and metal yielded the following results: 0.59, 0.56, 0.49, 0.55, 
0.65, 0.55, 0.51, 0.60, 0.56, 0.47, 0.58, 0.61, 0.54, 0.68, 0.56, 0.50, 0.57, and 
0.53. Use the sign test at the level of significance & = 0.05 to test the null 
hypothesis uw = 0.55 against the alternative hypothesis uw ~ 0.55. 

f Applying the sign test to the data of Exercise 9 on page 147, the one pertain- 
ing to the boiling point of a certain silicon compound, test the null hypothe- 


sis 4 = 158 (degrees centigrade) against the alternative hypothesis z+ 158. 
Use thetével of significance & = 0.05. 


4, Phe quality control department of a large manufacturer obtained the fol- 
lowing sample data (in pounds) on the breaking strength of a certain kind of 
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2-inch cotton ribbon: 153, 159, 144, 160, 158, 153, 171, 162, 159, 137, 159, 

159, 148, 162, 154, 159, 160, 157, 140, 168, 163, 148, 151, 153, 157, 155, 148, 

168, 162, and 149. Use the sign test at the level of significance & = 0.01 to 
fun the null hypothesis 4 = 150 against the alternative hypothesis uw > 150. 
5 


With reference to the data of Exercise 18 on page 224, use the sign test at the 
level of significance & = 0.05 to check the claim that Method B (of instruct- 
ing the trainees) is more effective than Method A. 


6. With reference to the data of Exercise 19 on page 225, use the sign test at the 
evel of significance & = 0.10 to check whether there is a systematic differ- 
ence between weights obtained with the two scales. 


. With reference to Exercise 20 on page 225, use the sign test and the level of 
significance & = 0.01 to check whether there really is a difference in the 
hardness of the two kinds of magnesium alloys. 


8. The following are the number of speeding tickets issued by two policemen on 
17 days: 7 and 10, 11 and 13, 14 and 14, 11 and 15, 12 and 9, 6and 10, 9 and 
13, 8 and 11, 10 and 11, 11 and 15, 13 and 11, 7 and 10, 8 and 8, 11 and 12, 
9 and 14, 10 and 9, 13 and 16. Use the sign test at the level of significance 
& = 0.05 to test the null hypothesis that on the average the two policemen 
issie equally many speeding tickets per day against the alternative hypothesis 

at on the average the second policeman issues more speeding tickets per 
day than the first. 


. The following are the number of minutes it took a sample of 15 men and 12 
women to complete the application form for a position: 


Men: 16.5, 20.0, 17.0, 19.8, 18.5, 19.2, 19.0, 18.2, 20.8, 
18.7, 16.7, 18.1, 17.9, 16.4, 18.9 
Women: 18.6, 17.8, 18.3, 16.6, 20.5, 16.3, 19.3, 18.4, 19.7, 
18.8, 19.9, 17.6 


Use the Mann-Whitney test at the level of significance & = 0.05 to test the 
null hypothesis that the two samples come from identical populations against 
the alternative that the two populations have unequal means. 


10. Referring to Exercise 20 on page 225, use the Mann-Whitney test at the level 
significance & = 0.05 to test whether the true average hardness of the two 
lloys is the same (see also Exercise 7 above). 


1¥. Comparing two kinds of emergency flares, a consumer testing service obtain- 
ed the following burning times (rounded to the nearest tenth of a minute): 


Brand C; 19.4, 21.5, 15.3, 17.4, 16.8, 16.6, 20.3, 22.5, 
Dlvdy Daeg £9.15 2h 

Brand D; 16.5, 15.8, 24.7, 10.2, 13.5, 15.9, 15.7, 14.0, 
12.1, 17.4, 15.6, 15.8 


Use the Mann-Whitney test and a level of significance of 0.01 to check 
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whether it is reasonable to say that there is no difference between the true 
average burning times of the two kinds of flares. 


12. To find the best arrangement of instruments on a control panel of an air- 
plane, two different arrangements were compared by simulating an emer- 
gency condition and measuring the reaction time required to correct the 
condition. The reaction times (in tenths of a second) of twenty pilots (ran- 
domly assigned to the two different arrangements) were as follows: 


Arrangement I: 8, 15, 10, 13, 17, 10, 9, 11, 12, 15 
Arrangement 2: 12, 7, 13, 8, 14, 6, 16, 7, 10, 9 


Use the Mann-Whitney test at the level of significance & = 0.05 to check the 
claim that the second arrangement is better. 


“With reference to Exercise 9, use the Mann-Whitney test (modified as sug- 
gested on page 278) to test the null hypothesis that the two samples come 
from identical populations against the alternative hypothesis that the popu- 
lations have unequal dispersions. Use the level of significance & = 0.05. 


14. The following are the weekly food expenditures (in dollars) of ten families 
with two children chosen at random from two suburbs of a large city: 


Suburb A: 54.78, 62.60, 51.89, 54.50, 56.00, 59.38, 48.19, 
70.45, 55.15, 51.95 

Suburb B: 50.12, 44.63, 64.91, 72.16, 74.59, 39.35, 52.76, 
78.19, 45.75, 68.72 


Use the Mann-Whitney test (modified as suggested on page 278) to test the 

null hypothesis that the two samples come from identical populations 

against the alternative that the two populations have unequal dispersions. 
se the level of significance & = 0.05. 


. The following are the number of misprints counted on pages selected at 
random from three Sunday editions of a newspaper: 


April ll: 4,10, 2, 6, 4, 12 
April 18: 8, 5, 13, 8, 8, 10 
April 25: 7,9, 11, 2, 14, 7 


Use the Kruskal-Wallis test at the level of significance & = 0.05 to test the 

null hypothesis that the three samples come from identical populations 

against the alternative that the compositors and/or proofreaders who worked 
_ ei fon the three editions are not equally good. 





( . ‘The following are the miles per gallon which a test driver got for ten tankfuls 
each of three brands of gasoline: 


Brand 1: 22, 25; 32, 18,23, 15, 30,27, 19,23 
Brand 2; 19; 22, 18, 29, 28, 32, 17, 33, 28, 20 
Brand 3: 30, 29, 25, 24, 15, 27, 30, 27, 18, 32 
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Test at the level of significance «& = 0.05 whether there is a difference in the 
true average performance of the three brands of gasoline. 


17. So-called Franklin tests were performed to determine the insulation proper- 
ties of grain-oriented silicon steel specimens that were annealed in five dif- 


ferent atmospheres with the following results: 





Atmosphere Test results (amperes) 
1 0.58 0.61 0.69 0.79 0.61 0.59 
2 0.37 0.37 0.58 0.40 0.28 0.44 0.35 
3 0.29 0.19 0.34 0.17 0.29 0.16 
4 0.81 0.69 0.75 0.72 0.68 0.85 0.57 0.77 
5 0.26 0.34 0.29 0.47 0.30 0.42 


Use the Kruskal-Wallis H test and a level of significance of 0.05 to decide 
whether these five samples can be assumed to come from identical popu- 


lations. 


18. A panel of seven experts was asked to rate each of five industries on the like- 
lihood that technological changes would produce improvement in environ- 
mental pollution over the next ten years. Their ratings (in the form of judg- 


mental probabilities) are as follows: 


Expert 


NAYNA NR WN 


A 


0.15 
0.30 
0.20 
0.00 
0.10 
0,25 
0.40 


B 


1 
0.60 
0.80 
0.50 
0.55 
0.70 
0.95 


Industry 


Cc 


0.10 
0.20 
0.30 
0.25 
0.15 
0.35 
0.45 


D 


0.00 
0.05 
0.00 
0.10 
0.15 
0,29 
0.20 


E 


0.30 
0.25 
0.50 
0.60 
0.40 
0.45 
0.35 


(a) Test whether the average scores given by the experts are significantly 
different (0.05 level of significance). 

(b) Test whether the average scores given to the five industries are signifi- 
, cantly different (0.01 level). What argument(s) can you make against the 
validity of this test? 


<A0.4 A TEST OF RANDOMNESS 


When we discussed random sampling in Chapter 6, we gave several 
methods which provide some assurance in advance that a sample taken will 
be random. Nevertheless, it is useful to have a technique for testing 
whether a sample may be looked upon as random after it has actually been 
obtained. One such technique is based on the order in which the sample 
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values were obtained; more specifically, it is based on the number of runs 
exhibited in the sample results. 

Given a sequence of two symbols, such as H and T (which might repre- 
sent the occurrence of heads and tails in repeated tosses of a coin), a run 
is defined as a succession of identical symbols contained between different 
symbols or none at all. For example, the sequence 


TTHHTTHHHTHHHTTTTHHHE 


contains 8 runs, as indicated by the underlines. The total number of runs 
in a sequence of 7 trials often serves as an indication that the arrangement 
is not random. For instance, if there had been only two runs consisting of 
ten heads followed by ten tails, we might have suspected that the prob- 
ability of a success did not remain constant from trial to trial. On the other 
hand, had the sequence of twenty tosses consisted of alternating heads and 
tails, we might have suspected that the trials were not independent. In 
either case, there are grounds to suspect a lack of randomness. Note that 
our suspicion is not aroused by the number of H’s and T’s, but by the 
order in which they appeared. 

If a sequence contains , symbols of one kind and n, of another kind 
(and neither 1, nor n, is less than 10), the sampling distribution of the 
total number of runs, u, can be approximated closely by a normal distribu- 
tion with the mean and the standard deviation 


L, = — 


_2nyN,_ r ] and a: _ = ,/[ AemCnin, a ny, = N>) 


ny +i (ny + N2)?*(M, + Nz — 1) 





Thus, the test of the null hypothesis that the arrangement of the symbols 
(and, hence, the sample) is random can be based on the statistic 





and Table 3. Special tables for performing exact tests when 7,, n,, or both 
are small may be found in the tables by D. B. Owen listed in the Bibliog- 
raphy. 


Example. To illustrate this test, let us consider the following arrange- 
ment of defective, d, and nondefective, n, pieces produced in the given 
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order by a certain machine: 


nnnnnddddnnnnnnnnnnddnndddd 


—_— ee , —e_C—_- — 


Since there aren, = 10 d’s, n, = 17 n’s, and u = 6 runs, substitution 
into the formulas yields 





2-10-17 
= [== 13.59 
Ky 10 a 17 + 
..  fd+10-TH2+ 10217 — 10 — TY) _. 
oa Ae Aes IF iy. ao 
and 
_6—13.59 
Since this value is less than —Zy 995 = —2.58, we can reject the null 


hypothesis of randomness at the level of significance « = 0.01. The 
total number of runs is much smaller than expected and there is a strong 
indication that the defective pieces appear in clusters or groups; the 
reason for this will have to be uncovered by an engineer who is familiar 
with the process. 


The run test can be used also to test the randomness of samples con- 
sisting of numerical data by counting runs above and below the median. 
Denoting an observation exceeding the median of the sample by the letter 
a and an observation less than the median by the letter b, we can use the 
resulting sequence of a’s and b’s to test for randomness by the method just 
indicated. A frequent application of this test is in quality control, where 
the means of successive small samples are exhibited on a graph in chrono- 
logical order. The run test can then be used to check whether there might 
be a trend in the data, so that it is possible to adjust a machine setting or 
some other process variable before any serious damage occurs. 


Example. To illustrate this kind of test, suppose that an engineer is 
concerned about the possibility that too many changes are being made 
in the settings of an automatic lathe. To test this hypothesis at a = 
0.01, the following mean diameters (in inches) are obtained for 40 
successive shafts turned on the lathe: 


0.261 0.258 0.249 0.251 0.247 0.256 0.250 0.247 0.255 0.243 
0.252 0.250 0.253 0.247 0.251 0.243 0.258 0.251 0.245 0.250 
0.248 0.252 0.254 0.250 0.247 0.253 0.251 0.246 0.249 0.252 
0.247 0.250 0.253 0.247 0.249 0.253 0.246 0.251 0.249 0.253 
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The median of these measurements is 0.250, and, replacing each by the 
letter a if it exceeds 0.250, by the letter 5 if it is less than 0.250, and 
omitting the five which equal 0.250, we obtain the sequence 


aababababaababaabbaabaabbababbababa 
having 27 runs. Thus, 7, = 19,, = 16, wu = 27, and we obtain 


= a oe + i= [e357 


_p 19- 16(2- 19-16 — 19 — 16) __ 5 go 
(19 + 16)2(19 + 16 — 1) 


and 


DF IAT... 


Since this value exceeds z, 5, = 2.33, we can reject the null hypothesis 
that the sequence of measurements is random. Since the number of 
runs is larger than one might expect due to chance, it is reasonable to 
suppose that the lathe is being adjusted too often; it is probably ad- 
justed after each rod is turned, to compensate for any observed dis- 
crepancy from a nominal diameter of 0.250 inch. 


Pal / 


193TH KOLMOGOROV-SMIRNOV TESTS 


The Kolmogorov-Smirnov tests are nonparametric tests for differences 
between cumulative distributions. The one-sample test concerns the agree- 
ment between an observed cumulative distribution of sample values and a 
specified continuous distribution function; thus, it is a test of goodness of 
fit. The two-sample test concerns the agreement between two observed 
cumulative distributions; it tests the hypothesis whether two independent 
samples come from identical continuous distributions, and it is sensitive 
to population differences with respect to location, dispersion, or skewness. 

The Kolmogorov-Smirnov one-sample test is generally more efficient 
than the chi-square test for goodness of fit for small samples, and it can be 
used for very small samples where the chi-square test does not apply. It 
must be remembered, however, that the chi-square test of Section 9.6 can 
be used in connection with discrete distributions whereas the Kolmogorov- 
Smirnov test cannot. 

The one-sample test is based on the maximum absolute difference D 
between the values of the cumulative distribution of a random sample of 
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size n and a specified theoretical distribution. To determine whether this 


difference is larger than can reasonably be expected, we look up the critical 
value of D in Table 10. 


Example. To illustrate this test, suppose it is desired to check whether 
pinholes in electrolytic tin plate are distributed uniformly across a 
plated coil on the basis of the following distances (in inches) of 10 pin- 
holes from one edge of a long strip of tin plate 30 inches wide: 


4.8 14.8 28.2 23.1 44 28.7 19.5 2.4 25.0 6.2 


Under the null hypothesis that the pinholes are uniformly distributed, 
the theoretical cumulative distribution with which we want to compare 
the observed cumulative distribution is given by 


0 forx <0 


F(x) = x for0 <x < 30 


l for x > 30 


The graph of this theoretical cumulative distribution, as well as that of 
the observed cumulative distribution, is shown in Figure 10.1. As 





Figure 10.1. Kolmogorov-Smirnov test. 
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indicated in this diagram, the maximum difference between the two 
cumulative distributions is 0.193, and since this is less than D, 9; 
= 0.410 (the critical value of D for n = 10 and a = 0.05), it follows 
that the null hypothesis (that the pinholes are uniformly distributed) 
cannot be rejected. 


The two-sample Kolmogoroy-Smirnov test is based on the maximum 
absolute difference between the values of the two observed cumulative 
distributions. In principle, it is very similar to the one-sample test, and 
the necessary critical values can be obtained from special tables (for exam- 
ple, those by D. B. Owen referred to in the Bibliography). 


EXERCISES 


(ir he following arrangement indicates whether sixty consecutive cars which 
“went by the toll booth of a bridge had local plates, L, or out-of-state plates, 
O: 
LLOLLLLOOLLLLOLOOLLLLOLOOLLELL 
(cont) OLLLOLOLLLLOOLOOOOLLLLOLOOLLLO 


Use the level of significance & = 0.05 to test whether this arrangement of 
’s and O’s may be regarded as random. 






o check the randomness of the digits in Table 7, take those in any five 
rows, replace the odd digits by the letter O and the even digits by the letter 
E, and base your decision on the total number of runs in the resulting 
/sequence of O’s and E’s. Use the level of significance & = 0.05. 


B To test whether radio signals from deep space contain a message, an interval 

"of time could be subdivided into a number of very short intervals and it 
could then be determined whether the signal strength exceeded a certain 
level (background noise) in each short interval. Suppose that the following is 
part of such a record, where H denotes a high signal strength and L denotes 
that the signal strength does not exceed a given noise level. 


LLHLHLHLHHHLHHALAAALALALALLAL 
LLHLHLHLHHHLHHALHHALALALALLLE 


Test this sequence for randomness (using the 0.05 level of significance) and 
ascertain whether it is reasonable to assume that the signal contains a 
message. 


4. The run test of Section 10.4 can also be used as a nonparametric alternative 
for testing the significance between two means. As in the Mann-Whitney 
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test, we rank the data belonging to the two samples jointly, write a 1 below 
each value belonging to the first sample, a 2 below each value belonging to 
the second sample, and then test for the randomness of this arrangement of 
1’s and 2’s. If there are too few runs, this may well be accounted for by the 
fact that the two samples came from populations with unequal means. 

(a) Apply this method to the example on page 275, which dealt with the 

grain size of two kinds of sand. 
(b) Use this method to rework Exercise 14 on page 281. 


. Making use of the fact that the median is 19.0 (see Exercise 13 on page 157), 


test the randomness of the sulfur oxides emission data on page 137 at the 
level of significance & = 0.05. 


/6) The following are the number of defective pieces turned out by a machine 


during 24 consecutive shifts: 15, 11, 17, 14, 16, 12, 19, 17, 21, 15, 17, 19, 21, 


e~ 14, 22, 16, 19, 12, 16, 14, 18, 17, 24, and 13. Test for randomness at the level 
7 } f significance & = 0.01. 


i 
f 
é Iie 
V 


The following are 50 consecutive down times of a machine (in minutes) 
which were observed during a certain period of time: 22, 29, 32, 25, 33, 34, 
38, 34, 29, 25, 27, 33, 34, 28, 39, 41, 24, 31, 34, 29, 34, 25, 30, 37, 40, 39, 35, 
24, 32, 43, 44, 34, 40, 38, 39, 43, 46, 34, 39, 45, 42, 39, 54, 50, 38, 41, 43, 46, 
52, and 55. Use the method of runs above and below the median and the level 
of significance & = 0.05 to test the null hypothesis of randomness against the 
alternative that there is a trend. 


4; Use the Kolmogorov-Smirnov test with & = 0.01 to decide whether the 
“ boiling points of Exercise 9 on page 147 can be assumed to come from a 


normal population having the mean 160°C and the standard deviation 10°C. 


yo ylaine. Use probability graph paper.] 


(of 


10. 


In a vibration study, certain airplane components were subjected to severe 
vibrations until they showed structural failures. Given the following failure 
times (in minutes), test whether they can be looked upon as a sample from an 
exponential population with the mean uw = 10: 


1.5 10.3 36 13.4 18.4 7.7 24.3 10.7 8.4 
In4 4.9 28 79 J19 120 162° 68 14,7 


Use the level of significance a = 0.05. 


Survival times (days) of 8 cancer-bearing mice that have been treated with 
a certain anticancer drug are as follows: 


16, fi; 24, 18, 35 35, 12,. 21 


Test whether these data are consistent with the assumption of a log-normal 
distribution of survival times. Use the level of significance a = 0.01. 








Curve Fitting 


11.1 THE METHOD OF LEAST SQUARES 


The main objective of many engineering investigations is to make predic- 
tions, preferably on the basis of mathematical equations. For instance, 
an engineer may wish to predict the amount of oxide that will form on 
the surface of a metal baked in an oven for a specified amount of time 
at 200°C, or the amount of deformation of a ring subjected to a compres- 
sive force of 1,000 pounds, or the time between recappings of a tire having 
a given tread thickness and composition. Usually, such predictions require 
that a formula be found which relates the dependent variable (whose 
value one wants to predict) to one or more independent variables. In this 
section we shall-consider the special case where a dependent variable is 
to be predicted in terms of a single independent variable. 
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In many problems of this kind the independent variable is observed 
without error, or with an error which is negligible when compared with the 
error (chance variation) in the dependent variable. For example, in measur- 
ing the amount of oxide on the surface of a metal specimen, the baking 
temperature can usually be controlled with good precision, but the oxide- 
thickness measurement may be subject to considerable chance variation. 
Thus, even though the independent variable may be fixed at x, repeated 
measurements of the dependent variable may lead to y-values which differ 
considerably. Such differences among y-values can be attributed to several 
causes, chiefly to errors of measurement and to the existence of other, 
uncontrolled variables which may influence the value of y when x is fixed. 
Thus, measurements of the thickness of oxide layers may vary over several 
specimens baked for the same length of time at the same temperature 
because of the difficulty in measuring thickness as well as possible differ- 
ences in the composition of the oven atmosphere, surface conditions of the 
specimens, and the like. 

It should be apparent from this discussion that in this context y is the 
value of a random variable whose distribution depends on x. In most 
situations of this sort we are interested mainly in the relationship between 
x and the mean of the corresponding distribution of y’s, and we refer to 
this relationship as the regression curve of y on x. (For the time being we 
shall assume that x is fixed, that is, not random; in Section 11.5 we shall 
consider the case where x and y are both values of random variables.) 

Let us first treat the case where the regression curve of y on x is /inear, 
that is, where, for any given x, the mean of the distribution of the y’s is 
given by w + fx. In general, an observed y will differ from this mean, and 
we shall denote this difference by e€, writing 


y=at+ pxt+e 


Thus, € is a value of arandom variable and we can always choose «@ so that 
the mean of the distribution of this random variable is equal to zero. The 
value of e for any given observation will depend on a possible error of 
measurement and on the values of variables other than x which might have 
an influence on y. 


Example. To give an example where the regression curve of y on x 
can reasonably be assumed to be linear, suppose that a tensile ring is to 
be calibrated by measuring the deflection in thousandths of an inch 
at various loads in thousands of pounds. In the following table (giving 
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the results of 12 measurements), the x’s are the load forces in thousands 
of pounds and the y’s are the corresponding deflections in thousandths 
of an inch: 


x1 1 23 4 35 6 FY & YF DW UH 


y 16 35 45 74 86 96 106 124 134 156 164 182 


It is apparent from Figure 11.1, where these data have been plotted, 
that it is reasonable to assume that the relationship (regression curve) 
is linear; that is, a straight line gives a very good approximation over 
the range of the available data. 


With regard to this example, we face the problem of using the data 
points plotted in Figure 11.1 to estimate the parameters « and f of the 
regression line, and this is equivalent to finding the equation of the 
straight line which “best fits” the data points. In this example it may well 
be satisfactory to do this “by eye,” and if different experimenters were to 








Figure 11.1. Linear regression. 
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fit a line in this way, they would probably all predict that at a loading 
of 7,500 pounds the deflection should be close to 0.115 inch. However, 
if we have to deal with data such as those plotted in Figure 11.2, the prob- 
lem of finding a best-fitting line is not so obvious. To handle problems of 
this kind, we must seek a nonsubjective method for fitting straight lines 
which reflects some desirable statistical properties. 





Figure 11.2. Least-squares criterion. 


To state the problem formally, we have n paired observations (x,, y,) 
for which it is reasonable to assume that the regression of y on x is linear, 
and we want to determine the line (that is, the equation of the line) which 
in some sense provides the “best” fit. There are several ways in which we 
can interpret the word “best,” and the meaning we shall give it here may 
be explained as follows. If we predict y by means of the equation 


y =a-+ bx 


where a and b are constants, then e;, the error in predicting the value of y 
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corresponding to the given x;, 1s 


Vi — Vi = 


Note that the equation y’ = a + bx provides an estimate of the equation 
of the regression line whose actual, but unknown, equation is y = « + fx. 
The actual error in predicting y, is €,, and this error is estimated by the 
quantity y; — y; =e; We shall attempt to determine a and 6 so that 
the estimated errors are in some sense as small as possible. 

Since we cannot minimize each of the e, individually, it suggests itself 


that we might try to make their sum 3 e, as close as possible to zero. 


i=1 
However, since this sum can be made equal to zero by many choices of 
totally unsuitable straight lines for which the positive and the negative 
errors cancel, we shall instead minimize the sum of the squares of the e; 
(see also definition of standard deviation). In other words, we shall choose 
a and 6 so that 


ps [y; — (a + bx))/ 


i= 


is a minimum. Note from Figure 11.2 that this is equivalent to minimizing 
the sum of the squares of the vertical distances from the points to the line. 
This criterion, called the criterion of least squares, yields values for a and b 
(estimates for « and f) that have many desirable properties, some of which 
will be mentioned at the end of this section. 

A necessary condition for a relative minimum is the vanishing of the 
partial derivatives with respect to a and b. We thus have 


a>) [y, — (a + bx)|(—1) =0 
23 Ly — (a+ bx)I(—x) = 0 


and rewriting these equations in a somewhat more convenient form we 
obtain the following equations, called the normal equations: 





Sy,=ant+b>d x, 
i=1 i=1 


n n n 
Dy Yi = AQ, x + bd HF 
i= i= i= 
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The normal equations are a set of two linear equations in the unknowns 
a and b; their simultaneous solution gives the values of a and b for the line 
which, thus, provides the best fit to the given data according to the 
criterion of least squares. Note that they can easily be remembered as 
follows: we first write down the equation y,=a-+ bx, and then the 
equation x;y, = ax; + bx}, obtained from the first by multiplying both 
sides by x;. If we then sum both sides of each of these equations, we get 
the two normal equations (after some easy algebraic simplifications). 


Example. To illustrate the method of least squares, as it is used to 
fit a straight line to a given set of paired data, let us apply it to the 
data of Figure 11.2, pertaining to the air velocities and evaporation 
coefficients of burning fuel droplets in an impulse engine: 


Air velocity Evaporation coefficient 
(cm/sec) (mm/sec) 

x iv 

20 0.18 

60 0.37 
100 0.35 
140 0.78 
180 0.56 
220 0.75 
260 1.18 
300 1.36 
340 1 ey 
380 1.65 


Here the x’s are the air velocities in cm/sec and the y’s are the 
evaporation coefficients in mm?/sec. Since n = 10, 


SI x, = 2,000, Sx? = 532,000 
i=1 t=1 

Sy, = 8.35, x,y, = 2,175.40 
i=1 i=1 


the normal equations are 


8.35 = 10a + 2,0005 
2,175.40 = 2,000a + 532,000D 
Solving these two equations, we obtain a = 0.069, b = 0.0038, and the 


equation of the straight line which provides the best fit in the sense 
of least squares is 


y’ = 0.069 + 0.0038x 
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This equation might be used, for example, to predict that the evapora- 
tion coefficient of droplets when the air volocity is 190 cm/sec will be 
y’ = 0.069 + (0.0038)(190) = 0.79 mm?/sec. 


It is impossible to make any exact statements about the “goodness” 
of this prediction unless we make some assumptions about the underlying 
distribution of the evaporation coefficients and about the true nature of the 
regression, namely, that for a given x the true mean of the y’s is of 
the form « + fx. Looking upon the values of a and b obtained from the 
normal equations as estimates of the regression coefficients « and B, the 
reader will be asked to show in Exercises 13 and 14 on page 306 that these 
estimates are linear in the observations y, and that they are unbiased 
estimates of « and f. With these properties, we can then refer to the 
remarkable Gauss-Markov theorem which states that among all unbiased 
estimators for & and f which are linear in the y,, the least-squares estima- 
tors have the smallest variance. In other words, the least-squares estima- 
tors are the most reliable in the sense that they are subject to the smallest 
chance variations. A proof of the Gauss-Markov theorem may be found 
in the book by H. Scheffe referred to in the Bibliography. 


11.2 INFERENCES BASED ON THE 
LEAST-SQUARES ESTIMATORS 


The method of the preceding section is used when the relationship 
between x and the mean of y is linear, or close enough to a straight line 
so that the least-squares line yields reasonably good predictions. In what 
follows we shall assume that the regression is linear and furthermore that 
the n random variables having the values y, (i = 1, .. . , m) are independent- 
ly normally distributed with the means «% + Bx; and the common variance 
o*. If we write 


yi=a+ Bx, + €; 


it follows from these assumptions that the €; are values of independent 
normally distributed random variables having zero means and the com- 
mon variance o?. The various assumptions we have made here are illus- 
trated in Figure 11.3, showing the distributions of values of y, for several 
values of the x;. Note that these additional assumptions are required to 
discuss the goodness of predictions based on least-squares equations, the 
properties of a and b as estimates of « and £, and so on; they were not 
required to obtain the original estimates based on the method of least 
squares. 

Before we state a theorem concerning the distribution of the least- 
squares estimators of « and £, it will be convenient to introduce some 
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Figure 11.3. Assumptions underlying Theorem 11.1. 


special notation. The following expressions pertaining to the sample values 
(x;, y;) occur so often that it is useful to write them as 


—L 
® 
oe 
| 
= 
x 
Ras 
= 
| 
em, 
x 
Ra 
Sacto 
Oe, 
Ms 
= 
icc cca 


Using this notation, the reader will be asked to show in Exercise 15 on 
page 306 that the solutions of the two normal equations on page 293 can 
be written as 
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where x and j are, respectively, the means of the x’s and the y’s. Note also 
the close relationship between S,, and S,, and the respective sample 
variances of the x’s and the y’s; in fact, s* = S,,/n(u — 1) and s} = 
S,,y/n(n — 1), and we shall sometimes use this alternate notation. 

The variance go? defined on page 295 is usually estimated in terms of the 
vertical deviations of the sample points from the least-squares line. The 
ith such deviation is y, — y; = y; — (a + bx;) and the estimate of a” is 





S == 


2 Ls _ 2 
e n— 5 Oe [Lyi (a + bx;)] 


where s, is, traditionally, referred to as the standard error of estimate. An 
equivalent formula for this estimate of 07, which is more convenient to use 
in actual applications, is given by 





eee =. Cr 
n(n — 2)S,.. 


sf 





In these formulas the divisor m — 2 is used to make the resulting estimator 
for a? unbiased. The “loss” of two degrees of freedom is explained by the 
fact that the two regression coefficients « and # had to be replaced by 
their least-squares estimates. It can also be shown that under the given 
assumptions (n — 2)s2/o? is a value of a random variable having the chi- 
square distribution with n — 2 degrees of freedom. 

Based on the assumptions made concerning the distribution of the 
y’s, one can prove the following theorem concerning the distributions of 
the least-squares estimators of the regression coefficients % and f: 





THEOREM 11.1. Under the assumptions given on page 295, the sam- 
pling distributions of the statistics 





(a — = 9) | Ses nS x 
Sie ue? 


p= O—D), [Sx 


and 


are t distributions with n — 2 degrees of freedom. 
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Using the first of these statistics, we can construct a confidence interval 
for #, solving the double inequality —t,,. < t < t,,..* The limits of the 
resulting confidence interval are 





wo, {Sixx (nx)? 
a ae ba /2 Se nS, 


Similar arguments lead to the following confidence limits for P: 











on 
b Ek bn/2°Se /t 





Continuation of Example. To illustrate the construction of such confi- 
dence intervals for the regression coefficients % and f, we shall assume 
that the populations from which the data on evaporation coefficients 
on page 294 were obtained satisfy the necessary assumptions of in- 
dependence, normality, and equal variances. Using the numerical re- 
sults on page 294 together with 


ST y? = 9.1097 
i=1 
we first obtain 


S.. = 10(532,000) — (2,000)? = 1,320,000 
S,y = 10(9.1097) — (8.35)? = 21.3745 
Sy = 10(2,175.40) — (2,000)(8.35) = 5,054.00 


and, hence, 


» _ (1,320,000)(21.3745) — (5,054.00)? 


(10)(8)(1, 320,000) = 0.0253 


S 


Since ¢ 925 equals 2.306 for 10 — 2 = 8 degrees of freedom, we get the 
following 0.95 confidence intervals for « and f: 


1,320,000 + (2,000)2 
0.069 ++ (2.306)(0.159) af 1 Ge 


*Although the symbol «@ is used here both for a parameter of the regression line and 
for the level of confidence, the formulas are such that there should be no ambiguity. 


299 11.2 Inferences Based on the Least-Squares Estimators 


or 
—0.164 < a < 0.302 
and 
0.0038 -E (2.306) (0.159) ,/ —~o _. 
1,320,000 
or 


0.0028 < B < 0.0048 


The results of Theorem 11.1 can also be used to establish criteria for 
testing hypotheses concerning « and f. In this connection, the parameter 
B is of special importance as it represents the s/ope of the regression line; 
that is, B gives the change in the mean of y corresponding to a unit increase 
in x. If 8 = 0, the regression line is horizontal and the mean of y does 
not depend linearly on x. 

To test the null hypothesis H,: 6 = B, we can make use of the second 
t statistic of Theorem 11.1, namely, 





p= OTF [Pas 





and the resulting critical regions are as shown in the following table: 


CRITICAL REGIONS FOR TESTING Ho: 8 = Bo 














Reject Ho if 





Alternative hypothesis 
t <— —Te 
t> le 


f < —tyyg OT F > Te/2 





where t, is obtained from Table 4 with n — 2 degrees of freedom. 


Continuation of Example. To illustrate this kind of test, let us test 
for the significance of the linear regression (linear dependence) of the 
evaporation coefficient on air velocity, basing the calculations on the 
data given on page 294. The null hypothesis to be tested is Hy: f =0 
and the appropriate alternative is H,: B 4 0. Using the calculations 
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on page 298, we obtain 


0.0038 — 0 /1,320,000 _ 
== fo ig 


and since this exceeds 2.306, the value of ¢ ,,, for 8 degrees of freedom, 
we can reject the null hypothesis at the 0.05 level of significance. In 
other words, we conclude that there is a relationship between air 
velocity and the average evaporation coefficient. 


Another problem, closely related to the problem of estimating the 
regression coefficients « and f, is that of estimating « + Bx, namely, the 
mean of the distribution of the y’s for a given value of x. If x is held fixed 
at x», the quantity we want to estimate is « + Bx, and it would seem rea- 
sonable to use a + bx,, where a and bd are again the values obtained by the 
method of least squares. In fact, it can be shown that this estimator is 
unbiased, has the variance 


off Mie 


and that a 1 — a confidence interval for % + Bx, is given by 





(a => Dog) Se tape 8, ae + main — BY 


where ¢,. is to be obtained from Table 4 with n — 2 degrees of freedom. 

Of even greater importance than the estimation of w + fx, is usually 
the prediction of y’ (a future value of y) when x = x. Thus, for our 
example we already showed on page 295 that for an air velocity of 190 
cm/sec (a value not included in the experiment), the predicted evaporation 
coefficient is 0.79 mm2/sec. Now let us indicate a method of constructing 
an interval in which y can be expected to lie with a given probability when 
x = X,. If « and # were known, we could use the fact that y is a random 
variable having a normal distribution with the mean % + fx, and the 
variance a? (or that y — a — Px, is a value of a random variable having 
a normal distribution with zero mean and the variance a”). However, if 
a and # are not known, we must consider the quantity y — a — bx), 
where y, a, and 5 are all values of random variables, and the resulting 
theory leads to the following limits of prediction for y when x = x): 
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» — x)? 


(a + Bx) ta“ Sea/ I jt 4 Me 


where the number of degrees of freedom for t,,. is again n — 2. 


Continuation of Example. With reference to the air velocity and 
evaporation-coefficient data, let us first illustrate the construction 
of confidence limits for a + bx, when x, = 190. Substituting the 
various quantities already calculated into the formula on page 300, 
we can write 0.95 confidence limits for ~ + 190 as 


‘1 (0)\1190 — 200)2 
0.79 + (2.306)(0.159),/. 4 UCP — 200)" 


and, hence, 
0.67 <a-+ 1908 < 0.91 


Correspondingly, 0.95 limits of prediction for a future value of y 
when x = 190 are given by 


77101190 = 20072 
0.79 + (2.306)(0.159)4/1 + zr + UOTE — 00) 


or 0.79 + 0.39. 


Note that although the mean of the distribution of y’s when x = 190 
can be estimated fairly closely, the value of a single future observation 
cannot be predicted with good precision. Observe from the formula for 
the limits of prediction that even as m — oo the difference between the 
limits of prediction does not approach zero. The limiting width of the 
interval of prediction depends on s,, which expresses the inherent vari- 
ability of the data. Note, further, that if we wish to extrapolate, that is, 
to predict a future value of y corresponding to a value of x outside the 
range of the observations, the limits of prediction (and also the confidence 
limits for w + Bx) become increasingly wide. 


Continuation of Example. For instance, 0.95 limits of prediction for 
y corresponding to x = 450 cm/sec are given by 


, , 1, 10(450 — 200)? 
» 178 (2.306)(0.159) q/1 + a a Oo 
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or 1.78 + 0.46. In other words, we can assert with a probability of 
0.95 that the evaporation coefficient corresponding to an air velocity 
of 450 cm/sec will fall between 1.32 and 2.24 mm?2/sec. Poor as it is, 
this prediction is also based on the assumption that the true regression 
continues to be linear when we go beyond the range of the observations. 


EXERCISES 


if 


i A chemical company, wishing to study the effect of extraction time on the 
efficiency of an extraction operation, obtained the data shown in the fol- 
lowing table: 


Extraction time Extraction efficiency 
(minutes) (percent ) 
x Rd 
Zt 57 
45 64 
4] 80 
19 46 
35 62 
39 72 
19 52 
49 ri 
15 aT 
31 68 


(a) Assuming the regression of y on x to be of the form y = &@ + Bx, find 
the normal equations for estimating the parameters & and B and solve 
for a and DB. 

(b) Graph the data together with the straight line obtained in part (a), and 
use it to read off the extraction efficiency one can expect when the extrac- 

/ tion time is 35 minutes. 


f- Solving the normal equations on page 293 symbolically, show that 
gq a (VY) = Oi 2) XY) 
n> x?) — (Dy x)? 
b — MOY xy) — (i OO: Y) 
n(>; x7) — (, x)? 


(The indices and limits of summation have been omitted for simplicity.) 
Use these formulas to verify the results obtained in part (a) of Exercise 1. 
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Jat the accompanying table, x is the tensile force applied to a steel specimen 
in thousands of pounds and y is the resulting elongation in thousandths of 
an inch. Assuming the regression of y on x to be linear, estimate the param- 
eters of the regression line and construct a 0.95 confidence interval for B, 
the elongation of the specimen per thousand pounds of tensile stress. 


x 1 


y 14 33 


3 4 


40 63 


5 6 


76 85 


ra following data pertain to the demand for a product (in thousands of 
units) and its price (in cents) charged in seven different market areas: 


Price 


x 


11 

9 
12 
10 
15 
12 

6 


Dema 
Y 


145 
177 
109 
ios 

81 
118 
218 


nd 


(a) Estimate & and P, the coefficients of the regression line of y on x. 
(b) Test the null hypothesis B = —20 at a level of significance of 0.01. 


5. The following show the improvement (gain in reading speed) of eight stu- 
dents in a speed-reading program, and the number of weeks they have been 


in the program: 


Number of 
weeks 


RW OW ON WN WD 







fidence interval for f. 


Spe 


ed gain 


(words per minute) 


86 
118 
49 
193 
164 
22 
73 
109 


Fyt a straight line to these data by the method of least squares, using number 
f weeks in the program as the independent variable. Also, find a 0.90 con- 


6. Raw material used in the production of a synthetic fiber is stored in a place 


which has no humidity control. Measurements of the relative humidity in the 
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storage place and the moisture content of a sample of the raw material (both 
in percentages) on 12 days yielded the following results: 


(a) 


(b 


) 


(Cc) 


(d) 


x y 
Humidity Moisture content 
42 12 
35 8 
50 14 
43 9 
48 11 
62 16 
31 7 
36 9 
44 2 
39 10 
Sis 13 
48 11 


Estimate & and f, the coefficients of the regression line of y on x. 

Find a 0.99 confidence interval for the mean moisture content of the raw 
material when the humidity of the storage place is 40 percent. 

Find 0.95 limits of prediction for the moisture content of the raw 
material when the humidity of the storage place is 40 percent. 
Referring to part (c), indicate to what extent the width of the interval is 


, affected by the size of the sample and to what extent it is affected by the 


f 


inherent variability of the data. 


af Referring to the data of Exercise 3: 


(a) Find a 0.95 confidence interval for the mean of the distribution of the 


(b 


) 


y’s when x = 3.5. 

Express the 0.95 confidence limits for the mean of the distribution of the 
y’s in terms of x); plot a graph of the estimated regression line and, 
choosing suitable values of x, sketch graphs of the loci of the upper and 
lower confidence limits on the same set of coordinate axes. (The resulting 
“confidence bands” do not give confidence intervals for the line itself; in 
fact, since any two confidence intervals obtained from these bands are 
dependent, they should be used only once in connection with some fixed 
value Xo.) 


. Referring to Exercise 1, express 0.95 limits of prediction for the extraction 
efficiency in terms of the extraction time x). Choosing suitable values of xo, 
sketch graphs of the loci of the upper and lower limits of prediction on the 
same set of coordinate axes. (Note that, as in Exercise 7, such bands should 
be used only once in connection with some given extraction time.) 


9, In Exercises 3 and 5 it would have been entirely reasonable to impose the 
condition & = 0 before fitting a straight line by the method of least squares. 
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(a) Using the method of least squares, derive a formula for estimating B 
when the regression line is assumed to have the form y = Px. 

(b) Using the data of Exercise 3 and the formula obtained in part (a), esti- 
mate # and compare the result with the estimate previously obtained 
without the condition that the line must pass through the origin. 

(c) Using the data of Exercise 5 and the formula obtained in part (a), esti- 
mate £ and compare the result with the estimate previously obtained 

. without the condition that the line must pass through the origin. 


~ The cost of manufacturing a lot of a certain product depends upon the lot 
size as shown in the following table: 


Cost (dollars)| 30 70 140 270 530 1,010 2,500 5,020 





Lot size 1 5 10 2 50 100 250 500 


Fit a straight line to these data by the method of least squares, using lot size 
as the independent variable. Also, find a 0.90 confidence interval for 4, 
which can here be interpreted as the fixed overhead cost of manufacturing. 


11. Referring to the data of Exercise 10, fit a straight line using the cost as the 
independent variable. Graph the resulting line together with the one found 
ip’Exercise 10 in one diagram and note that the two regression lines do not 
coincide. 

When the sum of the x-values is equal to zero, the calculation of the coeffi- 


cients of the regression line of y on x is greatly simplified; in fact, their esti- 
mates are given by 


a 


J XY 
a=%2 and b= oy 
This simplification can also be attained when the x’s are equally spaced, that 
is, when they are in arithmetic progression. We then “code” the data by 
substituting for the x’s the values..., —2, —1,0,1,2,..., whenz is odd, 
or the values..., —3, —1,1,3,..., whenz is even. The above formulas are 
then used in connection with the coded data. 
(a) Use this technique to fit a least-squares trend line to the following data 
on the income from tourism in Arizona (in millions of dollars): 


Income from 


Year tourism 
1965 420 
1966 450 
1967 480 
1968 500 
1969 530 
1970 565 
1971 600 
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Also use the result to predict Arizona’s income from tourism in the year 
1978, and calculate 0.90 limits of prediction for this estimate. 

(b) Use this technique to fit a least-squares trend line to the following data 
on a company’s annual profits in millions of dollars: 


Year Profit 
1970 a, 
197] 6.1 
1972 8.0 
1973 8.9 
1974 10.1 
1975 10.7 


Also use the result to predict the company’s profit in 1980, and calculate 
0.80 limits of prediction. 

(c) What assumption are you making in connection with your predictions 
of tourism in 1978 and profits in 1980? 


13. Using the formulas obtained in Exercise 2, show that 
(a) the expression for a is linear in the y,; 
(b) ais an unbiased estimate of &. 


14. Using the formulas obtained in Exercise 2, show that 
(a) the expression for 5 is linear in the y;; 
(b) b is an unbiased estimate of f. 


15. Show that the least-squares estimates of the coefficients of the regression 
line of y on x can be written in the form 


a=y—b-x and b=—> 





{1.3 CURVILINEAR REGRESSION 


So far we have studied only the case where the regression curve of 
y on x is linear; that is, where for any given x, the mean of the distribution 
of the y’s is given by a + fx. In this section, we shall first investigate 
cases where the regression curve is nonlinear, but where the methods 
of Section 11.1 can nevertheless be applied; then we shall take up the 
problem of polynomial regression, that is, problems where for any 
given x the mean of the distribution of the y’s is given by By + Bix+ 
Bix? +... + B,x?. This last technique is also used to obtain approxi- 
mations when the functional form of the regression curve is unknown. 

It is common practice for engineers to plot paired data on various 
kinds of graph paper, in order to determine whether for suitably trans- 
formed scales the points will fall close to a straight line. If that is the case, 
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the nature of the transformation used leads to a functional form of the 
regression equation, and the necessary constants (parameters) can be 
determined by applying the method of Section 11.1 to the transformed 
data. If a set of paired data consisting of n points (x,, y;) “straightens out” 
when plotted on semi-log paper, for instance, this indicates that the 
regression curve of y on x is exponential, namely, that for any given x, 
the mean of the distribution of the y’s is given by a«- 6%. Taking logarithms 
to the base 10 of both sides of the predicting equation* 


Val = a+ B* 
we obtain 
log y’ = loga + x-log Bp 


and we can now get estimates of log « and log #, and hence of a and P, 
by applying the method of Section 11.1 to the 7 pairs of values (x;, log y,). 


Example. To illustrate the least-squares fitting of an exponential 
curve, let us consider the following data on the percentage of the 
radial tires made by a certain manufacturer that are still usable after 
having been driven for the given number of miles: 


Miles driven Percentage 
(thousands) usable 
x y 
1 98.2 
2 91.7 
5 81.3 
10 64.0 
20 36.4 
30 32.6 
40 17.1 
50 113 


When plotting the data on semi-log graph paper as in Figure 11.4, we 
observe that the points fall fairly close to a straight line and that it 
is, therefore, justifiable to fit an exponential curve. Determining first 
the logarithms of the eight y’s, we obtain, respectively, 1.9921, 1.9624, 
1.9101, 1.8062, 1.5611, 1.5132, 1.2330, 1.0531, and the summations 
required for substitution into the normal equations are >) x = 158, 
>) x? = 5,530, }) log y = 13.0312, and >) x-log y = 212.1224, where 
the subscripts and limits of summation have been omitted for sim- 
plicity. Again using a and 6 for the least-squares estimates of a and ~, 


* Any other base could have been used. 
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Figure 11.4. Exponential regression. 


we obtain the normal equations 


13.0312 = 8(log a) + 158(log b) 
212.1224 = 158(log a) + 5,530(log b) 


and, solving this system, we get 
loga = 2.0002 and log b = —0.0188 


Hence, a = 100.0, b = 0.96, and the exponential curve fit by the meth- 
od of least squares has the equation 


y’ = 100(0.96)* 


This equation can also be written as y’ = 100-10~°°!*®* or YY = 
100-e-°-9433*, making use of the fact that 10~°-°18§ = e~ 9 °433, If we 
want to base a prediction on an exponential equation fitted to a set 
of data, it is usually more convenient to use it in its logarithmic form. 
For instance, if we want to predict what percentage of the tires will 
last at least 25,000 miles, we get 


log y = 2.0002 + 25(—0.0188) 
a+ 15302 


and, hence, 33.9 percent. 
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Two other relationships, frequently arising in engineering practice, 
that can be fitted by the method of Section 11.1 after suitable transfor- 


mations are the reciprocal function given by y’ = , and the power 


| 
Ko px 
function y' = «4-x*. The first of these represents a linear relationship 
between x and 1/y’, namely, 


] 
=F = S47 

JF UA 

and we obtain estimates of w and f by applying the method of Section 11.1 
to the points (x,, 1/y,). The second represents a linear relationship between 
log x and log y’, namely, 


logy’ = loga + B-log x 


and we obtain estimates of log « and #, and hence of « and £, by applying 
the method of Section 11.1 to the points (log x,, log y;. Examples of 
other curves that can be fitted by the method of least squares after a suit- 
able transformation are given in Exercises 6 and 7 on page 318. 

If there is no clear indication about the functional form of the regres- 
sion of y on x, we often assume that the underlying relationship is at least 
“well behaved” to the extent that it has a Taylor series expansion and that 
the first few terms of this expansion will yield a fairly good approxima- 
tion. We thus fit to our data a polynomial, that is, a predicting equation 
of the form . 


Y= Pot Bit + fet cet pe 


where the degree is determined by inspection of the data or by more 
“scientific” methods to be discussed below. 

Given a set of data consisting of n points (x;, y,), we estimate the 
coefficients By, B,, B.,..., B, of the pth-degree polynomial by minimiz- 
ing 


p3 [v2 — (Bo + Bix, + Box? +... + Bx?)]? 


In other words, we are now applying the least-squares criterion by mini- 
mizing the sum of the squares of the vertical distances from the points 
to the curve (see Figure 11.5). Differentiating partially with respect to 
Bo, Bi, Ba, -. +, Bp, equating these partial derivatives to zero, rearranging 
some of the terms, and letting b; be the estimate of £,, we obtain the 
p + 1 normal equations 
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Soe by Yk De ee, De 





where the indices and limits of summation are omitted for simplicity. 
Note that this is a system of p + 1 Jinear equations in the p + | unknowns, 
by, b;,...,6,. Unless the choice of the x-values is very unusual, this 
system of equations will have a unique solution. 





Figure 11.6. Polynomial regression curve. 
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pet To illustrate the fitting of a polynomial by the method of 
least squares, we shall fit a quadratic (second-degree) polynomial to 


the following data, relating the drying time of a varnish to the amount 
of a certain additive: 


x y 
Amount of varnish Drying time 
additive (grams) (hours) 
0 12.0 
] 10.5 
2 10.0 
3 8.0 
4 7.0 
5 8.0 
6 7.5 
7 8.5 
8 9.0 


Inspection of these data plotted in Figure 11.6 indicates that a second- 
degree polynomial, having one relative minimum, should yield a fairly 
good fit. First calculating the sums required for substitution into the 
normal equations, we obtain 
> x = 36, >. x? = 204, yx? = 1296, Yi xt = 8772 
> 7 = 80.5, > xy = 299.0, > ry = 16970 
and we thus have to solve the following system of three linear equations 
in three unknowns: 
80.5 = 9b, + 366, + 2046, 
299.0 = 36b, + 2046, + 12966, 
1697.0 = 2045, + 12966, + 87726, 


Getting b) = 12.2, b, = —1.85, and b, = 0.183, we find that the 
equation of the least-squares polynomial is 


y’ = 12.2 — 1.85x + 0.183x? 


The graph of this equation is also shown in Figure 11.6. 

Having obtained this equation, we might now use it to predict 
the drying time when the amount of additive used is 6.5 grams. 
Substituting x = 6.5 into the equation, we obtain 


y = 12.2 — 1.85(6.5) + 0.183(6.5)? = 7.9 


that is, a predicted drying time of approximately 7.9 hours. Note that 
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it would have been rather dangerous to use the above equation to 
predict, say, the drying time which corresponds to 24.5 grams of 
additive. The risks inherent in extrapolation discussed on page 301, 
in connection with fitting straight lines, are greatly increased when 
polynomials are used to approximate unknown regression functions. 


In actual practice, it may be difficult to determine the degree of the 
polynomial to fit to a given set of paired data. As it is always possible 
to find a polynomial of degree at most n — 1 that will pass through each 
of n points corresponding to n distinct values of x, it should be clear that 
what we actually seek is a polynomial of Jowest possible degree that “ade- 
quately” describes the data. As we did in our example, it is often possible 
to determine the degree of the polynomial by inspection of the data. 

There also exists a more rigorous method for determining the degree 
of the polynomial to be fitted to a given set of data. Essentially, it con- 
sists of first fitting a straight line as well as a second-degree polynomial 
and testing the null hypothesis 8, = 0, namely, that nothing is gained 
by including the quadratic term. If this null hypothesis can be rejected, 
we then fit a third-degree polynomial and test the hypothesis B, = 0, 
namely, that nothing is gained by including the cubic term. This procedure 
is continued until the null hypothesis 8; = 0 cannot be rejected in two 
successive steps and there is, thus, no apparent advantage to carrying the 
extra terms. Note that in order to perform these tests it is necessary to 
impose the assumptions of normality, independence, and equal variances 
introduced in Section 11.2. Also, these tests should never be used “blindly,” 
that is, without inspection of the overall pattern of the data. 

The use of this technique is fairly tedious and we shall not illustrate 
it in the text. In Exercise 11 on page 319 the reader will be asked to apply 
it to the varnish-additive, drying-time data in order to check whether it 
is worthwhile to carry the quadratic term. (If the x values are equally 
spaced, it is possible to simplify the necessary calculations with the use 
of orthogonal polynomials, as is explained in Volume 2 of the book by 
M. G. Kendall and A. Stuart referred to in the Bibliography.) 





1.4 MULTIPLE REGRESSION 


Before we extend the methods of the preceding sections to problems 
involving more than one independent variable, let us point out that the 
curves obtained (and the surfaces we will obtain) are not used only to 
make predictions. They are often used also for purposes of optimization, 
namely, to determine for what values of the independent variable (or 
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variables) the dependent variable is a maximum or minimum. For instance, 
in the example on page 311 we might use the polynomial fitted to the data 
to conclude that the drying time is a minimum when the amount of 
varnish additive used is 5.1 grams (see Exercise 12 on page 320). 

Statistical methods of prediction and optimization are often referred 
to under the general heading of response surface analysis. Within the 
scope of this text, we shall be able to introduce two further methods of 
response surface analysis: multiple regression in this section and related 
problems of factorial experimentation in Chapter 13. 

In multiple regression, we deal with data consisting of n (r + 1)-tuples 
(X13 Xo; ++» Xp. Vi), Where the x’s are again assumed to be known with- 
out error while the y’s are values of random variables. Data of this kind 
arise, for example, in studies designed to determine the effect of various 
climatic conditions on a metal’s resistance to corrosion; the effect of kiln 
temperature, humidity, and iron content on the strength of a ceramic 
coating; or the effect of factory production, consumption level, and stocks 
in storage on the price of a product. 

As in the case of one independent variable, we shall first treat the prob- 
lem where the regression equation is linear, namely, where for any given 
set of values x,, x,,..., and x, the mean of the distribution of the y’s is 
given by 


Pat Pim Poke ins P £ ey, 


For two independent variables, this is the problem of fitting a plane to 
a set of m points with coordinates (x,;, X2,, y,;) as is illustrated in Figure 
11.7. Applying the method of least squares to obtain estimates of the 
coefficients Bo, B,, and B,, we minimize the sum of the squares of the 
vertical distances from the points to the plane (see Figure 11.7); symboli- 
cally, we minimize 


p2 [yi — (Bo + Bix: + B2%2,))? 


and it will be left to the reader to verify in Exercise 13 on page 320 that 
the resulting normal equations are 


Le = nbe + by Se Os 
DY Oy Ddy Dy DAT Oe Dy phe 


Di Hay = 0g Da Oy Dn OS 
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As before, we write the least-square estimates of By, B,, and B, as dg, b,, 


and b,. Note that in the abbreviated notation used >) x, stands for $7 x;;, 
i=l 


¥ x,x, stands for }) x4;X2; >) x,y stands for >) x,,y,, and so forth. 
i=1 i=1 


Example. To illustrate this method of fitting a plane to a set of data, 
let us consider the following data relating the number of twists required 
to break a forged alloy bar to the percentage of each of two alloying 
elements present in a metal: 


R X14 x2 
Number of Percent of Percent of 
twists element A element B 
38 1 5 
40 2 5 
85 3 5 
59 4 5 
40 1 10 
60 Z 10 
68 3 10 
53 4 10 
31 1 15 
35 2 15 
42 3 15 
59 4 Ls 
18 1 20 
34 2 20 
29 3 20 
42 4 20 


Substituting 

Xx, = 40, Yix,=200, Sxt?=120, Yx,x, = 500 
Sax = 3,000, J pee Ts, > xy = 1,982, >) Xo = 8,285 
into the normal equations, we obtain 


733 = 16b, + 40b, + 200d, 
1,989 = 40b, + 1205, + 500d, 
8.285 = 200b, + 500b, + 3,000D, 


The unique solution of this system of equations is b) = 48.2, b, = 7.83, 
b, = —1.76, and we can thus estimate (predict) the number of twists 
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Figure 11.7. Regression plane. 


required to break one of these bars by means of the equation 
y’ = 48.2 + 7.83x, — 1.76x, 


for any given pair of values of x, and x,. Note that b, and b, are 
estimates of the average change in y resulting from a unit increase in 


the corresponding independent variable when the other independent 
variable is held fixed. 


When a regression problem involves many variables and (or) the fitting 
of polynomials, the necessary calculations can become quite tedious. 
However, if the values of the independent variables can be controlled, it is 
possible to attain considerable simplifications by using appropriate spac- 
ings and subsequent coding (similar to that used in Exercise 12 on page 
305). 


Continuation of Example. Observe that in our example the x,’s as 
well as the x,”s are equally spaced and that, furthermore, each value 
(level) of x, occurs once in combination with each value (level) of x,. 
This arrangement makes it possible to reduce the labor of obtaining 
and solving the normal equations. Coding the x, values as well as the 
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x, values —3, —1, 1, and 3,* and calling them z, and z,, we can 
write the original data as 


y Z1 22 
38 —3 —3 
40 —] —3 
85 1 —3 
59 3 —3 
40 —3 —] 
60 —] —] 
68 1 —] 
53 3 —] 
31 —3 1 
35 —1 1 
42 1 1 
59 3 1 
18 —3 3 
34 —] 3 
29 1 3 
42 3 3 


Wenow cet >) 2, = GO, 3 25 = 0, >, 27 = 80, D size = 0, 3 = 8, 
Sy = 733, > 2,y = 313, > z.y = —351, and the normal equations 
become 


733 = 16b, 
313 = 808, 
—351 = 808, 


Thus, in terms of these coded variables, we have 
y’ = 45.8 + 3.91z, — 4.392, 


and the reader will be asked to verify in Exercise 16 on page 321 that 
(except for rounding errors) this result is equivalent to the one pre- 
viously obtained. 


The same coding would have produced even greater simplifications if 
we had wanted to fit a second-degree polynomial to our data, that is, 
if we had used an equation of the form 


vy = Bot Bir + Box. + B3xXyX2 + Baxi + B 5x5 


*Had there been five values of x1, we would have coded them —2, —1, 0, 1, and 2. 
In this kind of coding the values must always be equally spaced and their sum must 
equal zero. 
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Without coding, the method of least squares would have led to a system 
of normal equations consisting of six simultaneous linear equations in six 
unknowns. As the reader will be asked to verify in Exercise 17 on page 321, 
the same coding as used above reduces this problem, essentially, to that 
of solving three simultaneous linear equations in three unknowns. 

The example of this section served to illustrate how a careful choice 
of the values of the independent variables can simplify the calculations 
required in a problem of multiple regression. The principles and proce- 
dures for making this choice, whenever possible, are part of the important 
subject of experimental design. In Chapter 12 we shall consider some of 
the simpler, but nevertheless widely used, experimental designs. We shall 
focus our attention on what is called the analysis of variance, where tests 
are made concerning the significance of the effects of certain combina- 
tions of levels of the independent variables. In Chapter 13 we shall intro- 
duce the related problem of factorial experimentation. 


EXERCISES 


est aie following data pertain to the chlorine residual in a swimming pool at 
various times after it has been treated with chemicals: 


Number of hours Chlorine residual 
(parts per million) 
x y 
12 1.80 
24 1.47 
36 1.25 
48 1.10 
60 1.01 
Use the method of least squares to fit an exponential curve of the form 
y = a B 
“The following data pertain to the growth of a colony of bacteria in a culture 
medium: 
Days since Bacteria 
inoculation count 
x y 
3 115,000 
6 147,000 
9 239,000 
12 356,000 
15 579,000 


18 864,000 
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(a) Plot the data on semi-log graph paper and use the method of least 
squares to fit an exponential curve of the form y = a PB». 
) Use the result obtained in part (a) to estimate the bacteria count at the 
end of the first day. 


3. The following data pertain to the cosmic ray doses measured at various 


altitudes: 
Altitude Dose rate 

(feet) (mrem|year ) 

x JY 

50 28 

450 30 

780 32 

1,200 36 

4,400 eal 

4,800 58 

5,300 69 


Usé the method of least squares to fit an exponential curve of the form 
= &-eF*, 


. Use the method of least squares to fit a curve of the form y = &- x to the 
price-demand data of Exercise 4 on page 303. 


5. In an experiment designed to determine the specific heat ratio y for a certain 
gas, the gas was compressed adiabatically to several predetermined volumes 
V, and the corresponding pressure p was measured with the following 
results: 


p(ibjin®) | 16.6 39.7 78.5 115.5 195.3 546.1 
V (in3) 50 30 20 15 10 5 


Assuming the ideal gas law p: V” = C, use these data to 

(a) estimate y for this gas; 

)) construct a 0.95 confidence interval for y. (State what assumptions will 
have to be made.) 





Fit a Gompertz curve of the form 


<—Or 


etxtB 


y=e 
to the data of Exercise 1 and compare the fit with that of the exponential 
curve obtained in Exercise 1. 


7. The rise of current in an inductive circuit having the time constant T is given 
by 


fa =e 
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where ¢ is the time measured from the instant the switch is closed, and J is 
the ratio of the current at time ¢ to the full value of the current given by 
Ohm’s law. Given the measurements 


I 0.073 0.220 0.301 0.370 0.418 0.467 0.517 0.578 





t (sec) 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 


estimate the time constant of this circuit from the experimental results given. 


Pai : See Exercise 9 on page 304.] 
*8. The number of inches which a newly-built structure is settling into the 
ground is given by 
yo3—3e 
where x is its age in months. 
x 2 4 6 12 18 24 


y 1.07 1.88 220 2,48 Zoot 299 


Use the method of least squares to estimate &. 


a 

he following data pertain to the amount of hydrogen present (y, in parts 
per million) in core drillings made at one-foot intervals along the length of a 
vacuum-cast ingot (x, core location in feet from base): 





y 1.28 1.53 1.03 0.81 0.74 0.65 0.87 0.81 1.10 1.03 


Fit a parabola by the method of least squares. 


10. Suitably coding the years, fit a parabola to the following data on the pro- 
duction of television sets (in thousands) in Japan for the years 1963 through 
1969: 4,916, 5,273, 4,190, 5,652, 6,963, 9,001, and 12,118. 

lve fitting a polynomial to a set of paired data, we usually begin by fitting 
a straight line and using the method on page 299 to test the null hypothesis 
B, = 0. Then we fit a second-degree polynomial and test whether it is worth- 
while to carry the quadratic term by comparing 67, the residual variance 
after fitting the straight line, with @2, the residual variance after fitting the 
second-degree polynomial. Each of these residual variances is given by the 
formula 

po G dee 


degrees of freedom 


with y’ computed, respectively, from the equation of the line and the equa- 
tion of the second-degree polynomial. The decision whether to carry the 
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quadratic term is based on the statistic 


A2 A2 
Ps 0; 02 


2 
2 


Q 


which (under the assumptions of Section 11.2) is a value of a random variable 

having the F distribution with 1 and n — 3 degrees of freedom. 

(a) Fit a straight line to the varnish-additive and drying-time data on page 
311, test the null hypothesis B; = 0 at the 0.05 level of significance, and 
calculate 6?. 

(b) Using the result on page 311 calculate @3 for the given data and test at 
the 0.05 level whether we should carry the quadratic term. (Note that we 
could continue this procedure and test whether to carry a cubic term by 
means of a corresponding comparison of residual variances. Then we 
could test whether to carry a fourth-degree term, and so on. It is cus- 
tomary to terminate this procedure after two successive steps have not 
produced significant results.) 


2 
cd 


12. Referring to the example on page 311, verify that the drying time is a mini- 
mum when the amount of varnish additive used is 5.1 grams. 


13. Verify that the method of least squares leads to the system of normal equa- 
ftions on page 313. 


(A. Twelve specimens of cold-reduced sheet steel, having different copper con- 
~ tents and annealing temperatures, are measured for hardness with the fol- 
lowing results: 


Hardness Copper content Annealing temperature 
(Rockwell 30-T) (percent ) (degrees F) 
78.9 0.02 1000 
65.1 0.02 1100 
i i 0.02 1200 
56.4 0.02 1300 
80.9 0.10 1000 
69.7 0.10 1100 
57.4 0.10 1200 
55.4 0.10 1300 
85.3 0.18 1000 
71.8 0.18 1100 
60.7 0.18 1200 
58.9 0.18 1300 


Fit an equation of the form y = Bo + Bix: + B.x2, where x, represents 
the copper content, x, represents the annealing temperature, and y repre- 
sents the hardness. 


5. The following are data on the ages and incomes of five executives working 


f for the same company, and the number of years they went to college: 


i 
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Income 
Age Years college (dollars) 
x1 X2, ¥ 
37 4 31,200 
45 0 26,800 
38 5 35,000 
42 ya 30,300 
31 4 25,400 


Fit an equation of the form y = bp + b,x; + 62x, to the given data, and 
use it to estimate how much on the average an executive working for this 
company will make if he is 40 years old and has had 4 years of college. 


a Referring to the example involving the number of twists required to break a 
forged bar, verify that the result given on page 316, obtained by coding, is 
equivalent to the original result shown on page 315. 


17. Verify that if the x’s are coded as on page 316, the normal equations for 
estimating the coefficients of the second-degree polynomial 


y= Bo = Bix, a B x2 - BaxiXe = B4x? a B 3x3 


become 
733 = 16 + 8064, + 8055 
315 = 806, 
351 = 805, 
—19 = 4005; 
3453 = 805, + 656b, + 4006; 
3493 = 804) + 40064 + 65655 
18. Code the values of x, in Exercise 14 as z} = —1, 0, +1, and the values of 
X, aS Z, = —3, —1, +1, +3. Then write down the resulting normal equa- 


tions, solve, and show that the resulting regression equation is equivalent to 
the one obtained in Exercise 14. 


11.5 CORRELATION 


So far in this chapter, we have studied problems where the indepen- 
dent variable (or variables) was assumed to be known without error. 
Although this applies to many experimental situations, there are also 
problems where the x’s as well as the y’s are values assumed by random 
variables. This would be the case, for instance, if we studied the relation- 





322 Ch. 11 Curve Fitting 


ship between rainfall and the yield of a certain crop, the relationship 
between the tensile strength and the hardness of aluminum, or the relation- 
ship between impurities in the air and the incidence of a certain disease. 
Problems like these are referred to as problems of correlation analysis, 
where it is assumed that the data points (x;, y,) for i= 1, 2,...,n are 
values of a pair of random variables whose joint density is given by 
F(x, y). 

The bivariate density which is most commonly used in problems of 
correlation analysis is the bivariate normal distribution. We shall introduce 
it here in terms of the conditional density g,(y|x) and the marginal density 
f:(x), as defined in Section 4.10. So far as g,(y|x) is concerned, the 
conditions we shall impose are practically identical with the ones we used 
in connection with the sampling theory of Section 11.2. For any given x, 
it will be assumed that g,(y|x) is a normal distribution with the mean 
a& + fx and the variance o?. Thus, the regression of y on x is linear and the 
variance of the conditional density does not depend on x. Furthermore, 
we shall assume that the marginal density f,(x) is normal with the mean 
H#, and the variance oj. Making use of the relationship f(x, y) = 
F(x): 22(y |x) given on page 129, we thus obtain 








| | 
— =(x=p4)*/Zo%* —[y— (a + Bx))]?/20? 
x,y) = —=—e - ae 
F(*y) af 206 »/ 210 
= I ely (a + Bx)1?/20? + (x —41)?/2017} 
2°07; 


for —co < x < oo and —oo < y< oo, Note that this joint distribution 
involves the five parameters “,,0,, 4, B, anda. 

For reasons of symmetry and other considerations to be explained 
below, it is customary to express the bivariate normal density in terms of 
the parameters 4;,0;, 2,0 , and p. Here uw, and o3 are the mean and the 
variance of the marginal distribution /,(y), while p, called the correlation 
coefficient, is defined by 


bdo 


lc oe 


2/9 
Nd 


with p taken to be positive when f is positive (and negative when f is 
negative). Leaving it to the reader to show in Exercise 4 on page 329 that 


H,=a+ Bu, and of =a’ + fo? 


we then substitute into the above expression for f(x, y) and finally obtain 


323 11.5 Correlation 


the following form of the bivariate normal distribution: 


ne SI | (2-1) ery (oaet_2) | Cae) Pe Cae) TC 
I(x, y) ~~ 2n*G,0,/1 — p2~ o1 o1 o2 C2 


for —co <x < oo and —oo < y < oo (see Exercise 5 on page 329). 

Concerning the correlation coefficient p, note that -—l<p< +1 
since 3 = ao? + fa? and, hence, o3 > o*. Furthermore, p can equal 
—1 or +1 only when o? = 0, which represents the degenerate case where 
all the probability is concentrated along the line y = a + fx and there is, 
thus, a perfect linear relationship between the two random variables. 
(That is, for a given value of x, y must equal « + fx.) The correlation 
coefficient is equal to zero if and only if 0% = o%, and it follows from the 
identity 03 = o* + f?o% that this can happen only when # = 0. Thus, 
p = 0 implies that the regression line of y on x is a horizontal line and, 
hence, that knowledge of x does not help in the prediction of y. Thus, when 
p = +1 we say that there is a perfect linear correlation (relationship, 
or association) between the two random variables; when p = 0 we say 
that there is no correlation (relationship, or association) between the two 
random variables. In fact, p = 0 implies for the bivariate normal density 
that the two random variables are independent (see Exercise 6 on page 329). 

For values between 0 and +1 or 0 and —1, we interpret p by referring 
back to the identity 


1 o> o%—o? 
~— 2 
02 02 


p= 


on page 322. Since ao” is a measure of the variation of the y’s when x is 
known while a3 is a measure of the variation of the y’s when x is unknown, 
o% — o” measures the variation of the y’s that is accounted for by the 
linear relationship with x. Hence, p? tells us what proportion of the varia- 
tion of the y’s can be attributed to the linear relationship with x. 

Given a random sample of size n—that is, n pairs of values (x;, y,;)—it 
is customary to estimate p by means of the sample correlation coefficient 
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where S,,, S 


xy? xx? 


and S,, are as defined on page 296. This estimator is not 


unbiased and, except for the factor rT ~ —) 


stituting for ¢3 the sample variance of the y’s and for o” the estimator on 
page 297, namely, s? (see also Exercise 7 on page 329). Note that the sam- 
ple correlation coefficient r is often used to measure the strength of a linear 
relationship exhibited by sample data even if the data do not necessarily 
come from a bivariate normal population. 


it may be obtained by sub- 





Example. To illustrate the calculation of r, consider the following 
paired observations of the number of minutes it takes 10 mechanics to 
assemble a piece of machinery in the morning, x, and in the late 
afternoon, y: 


x ss 
Li! 10.9 
10.3 14.2 
12.0 13.8 
a a 
13.7 13.2 
18.5 ZAvd 
17.3 16.4 
14.2 19,3 
14.8 17.4 
13.3 19.0 


The calculations required to find r are as follows, where the subscripts 
and limits of summation are omitted for the sake of simplicity. First 
we determine the necessary sums, getting }) x = 142.3, } y = 166.8, 
> x? = 2,085.31, >) xy = 2,434.69, > y? = 2,897.80, and then, 
substituting into the formulas on page 296, we obtain 


S,, = 10(2,085.31) — (142.3)? = 603.81 
Si = 10(2,434.69) — (142.3)(166.8) = 611.26 
S,, = 10(2,897.80) — (166.8)? = 1,155.76 


Thus, 


= 611.26 _ 0.73 
~— ./(603.81)(1,155.76) 

which means that 100r? = 53 percent of the variation among the 

afternoon times is explained by (is accounted for or may be attributed 

to) corresponding differences among the morning times. 
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Whenever a value of r is based on sample data, we can perform a test 
of significance (a test of the null hypothesis p = p) or construct a con- 
fidence interval for p. For random samples from a bivariate normal 
population, this is usually based on the Fisher Z transformation, namely, 
the transformation from r to 











as it can be shown that the sampling distribution of this statistic is approxi- 


mately normal with the mean “lz = In — a 7 and the variance ° = 3° 


Thus, we can test the null hypothesis p = py with the statistic 











z= is me eed 3 ip hE OU — po) 
1/,/n — 3 (1 —r)( + po) 





which has approximately the standard normal distribution. In particular, 
we can test the null hypothesis p = 0 with the statistic 








fs pee ie 


p l—r 





These tests are facilitated by the use of Table 11, which gives the values 
of Z corresponding to r = 0.00, 0.01, 0.02,..., and 0.99. When r is nega- 
tive, we look up the value of Z corresponding to —r, and then take —Z. 


Continuation of Example. Returning to the example where we had 
r = 0.73, let us test the null hypothesis p = 0 at the level of signi- 
ficance o& = 0.05. Since the value of Z corresponding to r = 0.73 
is 0.929, the test statistic becomes 


z= ./n — 3(0.929) = ./10 — 3(0.929) = 2.46 


which exceeds z) 525 = 1.96. Thus, the null hypothesis p = 0 can be 
rejected and we conclude that there is a relationship between the 
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amount of time a mechanic takes to assemble the piece of machinery 
in the morning and in the late afternoon. 


To construct a confidence interval for p, we first construct a confidence 
interval for wz, the mean of the sampling distribution of Z, and then 
convert to r and p by means of Table 11. Making use of the theory on 
page 325, we can write the first of these confidence intervals as 





a/2 


__ Z «/2 Z 
Z 0 a ee eee 








Example. Suppose that for a given set of paired data (pertaining, say, 
to the mathematics and physics grades of 30 students) we obtained 
r = 0.70. Looking up the value of Z which corresponds to r = 0.70 
in Table 11, we get 0.867, and substituting this value together with 
n = 30 and z,,. = 1.96 into the above formula, we get the 0.95 con- 
fidence interval 


VS i em es (Fy 


at nf OF 
or 


0.490 < wz < 1.244 


Then, looking up the values of r for which Z is closest to 0.490 and 
1.244, respectively, we get the 0.95 confidence interval 


0.45 < p < 0.85 


for the “true” strength of the linear relationship between the grades 
of students in the two given subjects. 


Note that the confidence interval obtained in this example is fairly wide, 
which illustrates the fact that correlation coefficients based on relatively 
small samples are generally not very reliable. 


Example. To give an example involving negative values of r and Z, 
suppose that r= 0.20 and m = 40 for a given set of paired data. 
Since the value of Z which corresponds to r = 0.20 is 0.203 according 
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to Table 11, we first get the 0.95 confidence interval 


1.96 1.96 


or 
0419 = pe < 0,525 


Then, looking up the values of r for which Z is closest to 0.119 and 
0.525, respectively, we get the 0.95 confidence interval 


—0.12 < p < 0.48 


There are several serious pitfalls in the interpretation of the coefficient 
of correlation. First, it must be emphasized that r is an estimate of the 
strength of the /inear association between the random variables; thus, as 
illustrated in Figure 11.8, r may be close to zero when there is actually a 
strong (but nonlinear) relationship between the two random variables. 





Figure 11.8. Nonlinear relationship where correlation coefficient is zero. 


Second, and perhaps of greatest importance, a significant correlation does 
not necessarily imply a causal relationship between the two random vari- 
ables. Although it would not be surprising, for example, to obtain a high 
positive correlation between the annual sales of chewing gum and the 
incidence of crime in the United States, one cannot conclude that crime 
might be reduced by prohibiting the sale of chewing gum. Both variables 
depend upon the size of the population, and it is this mutual relationship 
with a third variable (population size) which produces the positive cor- 
relation. : 
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EXERCISES 


1. The following data pertain to the resistance (ohms) and the failure time 
(minutes) of certain overloaded resistors: 


Resistance Failure time Resistance Failure time 
43 32 36 36 
29 20 39 33 
44 45 36 21 
33 35 47 44 
33 22 28 26 
47 46 40 45 
34 28 42 39 
31 26 33 25 
48 37 46 36 
34 33 28 25 
46 47 48 45 
a7 30 45 36 


(a) Calculate the coefficient of correlation. 
(b) Test the null hypothesis p = 0 at the level of significance & = 0.05. 


2. Calculate r for the air velocities and evaporation coefficients of the example 
on page 294. Assuming that the necessary assumptions can be met, test for 
significance at & = 0.01. 


3. The following are measurements of the carbon content and the permeability 
index of 22 sinter mixtures: 


Carbon Permeability Carbon Permeability 


content (%) index content (%) index 
4.4 12 4.1 13 
ee 14 4.9 19 
4.2 18 4.7 22 
3.0 35 5.0 20 
4.5 23 4.6 16 
4.9 29 3.6 Zt 
4.6 16 4.9 21 
5.0 12 5.1 13 
4.7 18 4.8 18 
3.1 eal Sok ly 
4.4 ay fe) 11 


(a) Calculate the coefficient of correlation. 
(b) Find 99 percent confidence limits for p. 
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4, Evaluating the necessary integrals, verify the identities 


10. 


11. 


12. 


Uo =4+ Bu, and o% =o + B20? 
on page 322. 


. Substitute “. = &% + Bu, and 03 = a2 + $207 into the formula for the 


bivariate density given on page 322, and show that this gives the final form 
shown on page 323. 


. Show that for the bivariate normal distribution (a) independence implies 


zero correlation, and (b) zero correlation implies indepedence. 


. By substituting the sample variance of the y’s for 03 and s? for o% in the 


formula p2 = 1 — 02/032, verify the formula for r given on page 323 (except 
for a multiplicative constant). 


. If data on the ages and prices of 25 pieces of equipment yielded r = —0.57, 


test the null hypothesis p = 0 at the level of significance & = 0.01. 


. If for certain paired data nm = 18 and r = 0.44, test the null hypothesis 


p = 0.30 against the alternative hypothesis p > 0.30 at the level of signifi- 
cance & = 0.05. 


Construct a 0.95 confidence interval for p when the necessary assumptions 
are met and 

(a) yr = 0.75 and zn = 15; (b) r = 0.16 and n = 25; 

(c) r = —0.57 and n = 32. 


Construct a 0.99 confidence interval for p when the necessary assumptions 
are met and 

(a) r = 0.82 and n = 19; (b) r = 0.12 and n = 35; 

(c) r= —0.20 and n = 14. 


Instead of using the computing formula on page 323, the correlation coef- 
ficient r may be obtained from the formula 


_ pi dear ae 
= + f1 -Ge—2y Xv -— yy)? 


which is analogous to the formula used to define p. Although the computa- 
tions required by the use of this formula are tedious, the formula has the 
advantage that it can be used also to measure the strength of nonlinear 
relationships or relationships in several variables. For instance, in the multi- 
ple linear regression example on page 314, one could calculate the predicted 
values by means of the equation 


y = 48,2 + 7.83%; — 1.76x2 


and then determine r as a measure of how strongly y, the twist required to 
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13. 


14. 


break one of the forged alloy bars, depends on both percentages of alloying 

elements present. 

(a) Using the data on page 314, find 5} (y — y)? by means of the formula 
2.00 =I) =P 

(b) Using the regression equation obtained on page 315, calculate y’ for the 
sixteen points and then determine }) (y — y’)?. 

(c) Substitute the results obtained in (a) and (b) into the above formula for 
r. The result is the so-called multiple correlation coefficient. 


When we substitute for the actual values of paired data the ranks which the 
values occupy in the respective samples, the correlation coefficient can be 
written in the form 


; 6: >) a? 


OE Ge =T) 


in which it is called the rank correlation coefficient. In this formula, d, is the 

difference between the ranks of the paired observations (x;, y;) and n is the 

number of pairs in the sample. (In case of ties, we substitute for each of 

the tied observations the mean of the ranks which they jointly occupy, 

as in Section 10.3.) 

(a) Calculate r’ for the example on page 324, which deals with the time it 
takes mechanics to assemble a piece of machinery in the morning and 
in the late afternoon. Compare with the result obtained for r. 

(b) Calculate r’ for the data of Exercise 1 and compare with the value of r 
previously obtained. 

(c) Calculate r’ for the air velocities and evaporation coefficients of the 
example on page 294 and compare with the result obtained for r in 
Exercise 2. 


Since the assumptions on which the significance test of Section 11.5 are 
based are rather stringent, it is often desirable to use a nonparametric alter- 
native based on the rank correlation coefficient of Exercise 13. It is based on 
the theory that if there is no relationship between the x’s and the y’s (in fact, 
if they are randomly matched), the sampling distribution of r’ can be approxi- 
mated closely with a normal distribution having the mean 0 and the standard 
deviation 





Use the level of significance & = 0.01 to test the significance of the values of 
r’ obtained in the three parts of Exercise 13. 








Analysis 
of Variance 


Ve 


Some of the examples of Chapter 11 have already taught us that consider- 
able economies in calculation can result by appropriately planning an 
experiment in advance. What is even more important, proper experimen- 
tal planning can give a reasonable assurance that the results of an experi- 
ment will provide clear-cut answers to questions under investigation. While 
it is impossible to give in this chapter a complete discussion of experimental 
design, including the many pitfalls to which the experimenter is exposed, 
we shall begin by presenting in this section some of the general principles 
of experimental design. Several of the designs most frequently used in 
engineering and other applied research will be given in subsequent 
Sections. 
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Example. To illustrate some of the most important aspects of experi- 
mental design, let us consider the following situation. Suppose a steel 
mill supplies tin plate to three can manufacturers, the major specifi- 
cation being that the tin-coating weight should be at least 0.25 pound 
per base box. The mill and each can manufacturer has a laboratory 
where measurements are made of the tin-coating weights of samples 
taken from each shipment. Suppose, also, that some disagreement has 
arisen about the actual tin-coating weights of the tin plate being 
shipped, and it is decided to plan an experiment to determine whether 
the four laboratories are making consistent measurements. A com- 
plicating factor is that part of the measuring process consists of the 
chemical removal of the tin from the surface of the base metal; thus, it 
is impossible to have the identical sample measured by each laboratory 
to determine how closely the measurements correspond. 

One possibility is to send several samples (in the form of circular 
discs having equal areas) to each of the laboratories. Although these 
discs may not actually have identical tin-coating weights, it is hoped 
that such differences will be small and that they will more or less 
“average out.” In other words, it will be assumed that whatever dif- 
ferences there may be among the means of the four samples can be 
attributed to no other causes but systematic differences in measuring 
techniques and chance variability. This would make it possible to 
determine whether the results produced by the laboratories are con- 
sistent by comparing the variability of the four sample means with an 
appropriate measure of chance variation. 

Now there remains the problem of deciding how many discs are to be 
sent to each laboratory and how the discs are actually to be selected. The 
question of sample size can be answered in many different ways, one of 
which is to use the formula on page 216 for the standard deviation of 
the sampling distribution of the difference between two means. Sub- 
stituting known values of 0, and o, and specifying what differences 
between the true means of any two of the laboratories should be 
detected with a probability of at least 0.95 (or 0.98, or 0.99), it is pos- 
sible to determine n, = n, = n (see Exercise 10 on page 345). Suppose 
that this method and, perhaps, also considerations of cost and avail- 
ability of the necessary specimens lead to the decision to send a sample 
of 12 discs to each laboratory. 

The problem of selecting the required 48 discs and allocating 12 to 
each laboratory is not as straightforward as it may seem at first. To begin 
with, suppose that a sheet of tin plate, sufficiently long and sufficiently 
wide, is selected and that the 48 discs are cut as shown in Figure 12.1. 
The twelve discs cut from strip 1 are sent to the first laboratory, the 
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Figure 12.1. Numbering of tin-plate samples. 


twelve discs from strip 2 are sent to the second laboratory, and so 
forth. If the four mean coating weights subsequently obtained were 
then found to differ significantly, would this allow us to conclude that 
these differences can be attributed to lack of consistency in the mea- 
suring techniques? Suppose, for instance, that additional investigation 
shows that the amount of tin deposited electrolytically on a long sheet 
of steel has a distinct and repeated pattern of variation perpendicular 
to the direction in which it is rolled. (Such a pattern might be caused by 
the placement of electrodes, “edge effects,” and so forth.) Thus, even 
if all four laboratories measured the amount of tin consistently and 
without error, there could be cause for differences in the tin-coating 
weight determinations. The allocation of an entire strip of discs to each 
laboratory is such that inconsistencies among the laboratories’ mea- 
suring techniques are inseparable from (or confounded with) whatever 
differences there may be in the actual amount of tin deposited perpen- 
dicular to the direction in which the sheet of steel is rolled. 

One way to avoid this kind of confounding is to number the discs 
and allocate them to the four laboratories at random, such as in the 
following arrangement, which was obtained with the aid of a table of 
random numbers: 


Laboratory A: 3, 38, 17, 32, 24, 30, 48, 19, 11, 31, 22, 41 
Laboratory B: 44, 20, 15, 25, 45, 4, 14, 5, 39, 7, 40, 34 
Laboratory C: 12, 21, 42, 8, 27, 16, 47, 46, 18, 43, 35, 26 
Laboratory D: 9, 2, 28, 23, 37, 1, 10, 6, 29, 36, 33, 13 


If there were any actual pattern of tin-coating thickness on the sheet of 
tin plate, it would be “broken up” by the randomization. 
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Although we have identified and counteracted one possible sys- 
tematic pattern of variation, there is no assurance that there can be no 
others. For instance, there may be systematic differences in the areas of 
the discs caused by progressive wear of the cutting instrument, or there 
may be scratches or other imperfections on one part of the sheet which 
could affect the measurements. Thus, there is always the possibility that 
differences in means attributed to inconsistencies among the labora- 
tories are actually caused by some other uncontrolled variable, and it is 
the purpose of randomization to avoid confounding the variable under 
investigation with such other variables. 

By distributing the 48 discs among the four laboratories entirely at 
random, we have no choice but to include whatever variation may be 
attributable to extraneous causes under the heading of “chance varia- 
tion.” This may give us an excessively large estimate of chance variation 
which, in turn, may make it difficult to detect differences between the 
true laboratory means. In order to avoid this, we could, perhaps, use 
only discs cut from the same strip (or from an otherwise homogeneous 
region). Unfortunately, this kind of controlled experimentation presents 
us with new complications. Of what use would it be, for example, to 
perform an experiment which allows us to conclude that the labora- 
tories are consistent (or inconsistent), if such a conclusion is limited to 
measurements made at a fixed distance from one edge of a sheet? To 
consider a more poignant example, suppose that a manufacturer of 
plumbing materials wishes to compare the performance of several kinds 
of material to be used in underground water pipes. If such conditions 
as soil acidity, depth of pipe, and mineral content of water were all 
held fixed, any conclusions as to which material is best would be valid 
only for the given set of conditions. What the manufacturer really 
wants to know is which material is best over a fairly wide variety of 
conditions, and in designing a suitable experiment it would be advisable 
(indeed, necessary) to specify that pipe of each material be buried at 
each of several depths in each of several kinds of soil, and in locations 
where the water varies in hardness. 


The foregoing example serves to illustrate that it is seldom desirable to 
hold all or most extraneous factors fixed throughout an experiment in 
order to obtain an estimate of chance variation that is not “inflated” by 
variations due to other causes. (In fact, it is rarely, if ever, possible to 
exercise such strict control, that is, to hold a// extraneous variables fixed.) 
In actual practice, experiments should be planned so that known sources of 
variability are deliberately varied over as wide a range as necessary; fur- 


335 12.2 Completely Randomized Designs 


thermore, they should be varied in such a way that their variability can be 
eliminated from the estimate of chance variation. One way to accomplish 
this is to repeat the experiment in several blocks, where known sources of 
variability (that is, extraneous variables) are held fixed in each block, but 
vary from block to block. 


Continuation of Example. In the tin-plating problem we might thus 
account for variations across the sheet of steel by randomly allocating 
three discs from each strip to each of the laboratories as in the following 
arrangement: 





Strip 1 Strip 2 Strip 3 Strip 4 
Laboratory A: 8, 4, 10 23, 24, 19 26, 29, 35 37, 44, 48 
Laboratory B: 2, 6, 12 21 Loy 22 34, 33, 32 45, 43, 46 
Laboratory C: 1, 5, 11 16, 20, 13 36, 29, 30 41, 38, 47 
Laboratory D: 7, 3, 9 17, 18, 14 28, 31, 25 39, 40, 42 


In this experimental layout, the strips form the blocks, and if we base 
our estimate of chance variation on the variability within each of the 
sixteen sets of three discs, this estimate will not be inflated by the 
extraneous variable, that is, differences among the strips. (Note also 
that, with this arrangement, differences among the means obtained for 
the four laboratories cannot be attributed to differences among the 
strips. This cannot be said for the arrangement on page 333.) 


The analysis of experiments in which blocking is used to eliminate one 
source of variability will be discussed in Section 12.3. The analysis of 
experiments in which two or three sources of variability are thus eliminated 
will be treated in Section 12.5. 


12.2 COMPLETELY RANDOMIZED DESIGNS 


In this section we shall consider, in general, the statistical analysis of 
the completely randomized design, or one-way classification. We shall sup- 
pose that the experimenter has available the results of k independent 
random samples, each of size n, from k different populations (that is, data 
concerning & treatments, k groups, k methods of production, etc.); and he 
is concerned with testing the hypothesis that the means of these k popula- 
tions are all equal. An example of such an experiment, with k = 4, is given 
by the layout on page 333. If we denote the jth observation in the ith 
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sample by y,;, the general schema for a one-way classification is as follows: 


Means 
Sample 13 9445 Vi2e<++oJ1j.e% is F tm V1 
Sample 2: 21s V220-++sV2fo000s Van y2 
Sample i: _Vits Vids 0s Vify oo +s Vin Yi 
Sample ki: Viety Vd» eee ys VK joe +9 Vin Yk 
y. 


With reference to the experimental layout on page 333, y,; i = 1, 2, 3, 4; 
j=1,2,..., 12) is the jth tin-coating weight measured by the ith labora- 
tory, y, is the mean of the measurements obtained by the ith laboratory, 
and j. is the overall mean (or grand mean) of all 48 observations. 

To be able to test the hypothesis that the samples were obtained from 
k populations with equal means, we shall make several assumptions. 
Specifically, it will be assumed that we are dealing with normal populations 
having equal variances. There exist methods for testing the reasonableness 
of this last assumption (see the book by A. M. Mood and F. A. Graybill 
mentioned in the Bibliography), but the methods we shall develop in this 
chapter are fairly robust; that is, they are relatively insensitive to violations 
of the assumption of normality as well as the assumption of equal variances. 

If wu, denotes the mean of the ith population and o” denotes the com- 
mon variance of the k populations, we can express each observation y;,,; as 
lu; plus the value of a random component; that is, we can write 


Vi = Ma + Gy fOr? 24, 2c ccc tS Lf 


In accordance with the above assumptions, the €;, are values of indepen- 
dent, normally distributed random variables with zero means and the 
common variance o*.* 

To attain uniformity with corresponding equations for more com- 
plicated kinds of designs, it is customary to replace wu; by uw + @,, where u 


k 
is the mean of the yw, and, hence, 5) a; = 0 (see Exercise 11 on page 
i=1 


*Note that this equation, or model, can be regarded as a multiple regression equa- 
tion; introducing the variables x;; which equal 0 or 1 depending on whether the two 
subscripts are unequal or equal, we can write 

Vig = MiXi1 + Max%i2 +... UX + tj 


The parameters yw; can now be interpreted as regression coefficients, and they can be 
estimated by the least-squares methods of Chapter 11. 
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345). Using these new parameters, we can write the model equation for the 
one-way classification as 





=f t, + €;; CG) iy a coe ad et 


and the null hypothesis that the k population means are all equal can be 
replaced by the null hypothesis thata, =a, =... = 0, = 0. Thealterna- 
tive hypothesis that at least two of the population means are unequal is 
equivalent to the alternative hypothesis that «, + 0 for some i. 

To test the null hypothesis that the k population means are all equal, 
we shall compare two estimates of g*—one based on the variation among 
the sample means, and one based on the variation within the samples. 
Since, by assumption, each sample comes from a population having the 
variance o”, this variance can be estimated by any one of the sample 
variances 


(Vis — Vi)? 
= a ae 


and, hence, also by their mean 


Note that each of the sample variances s? is based on n — 1 degrees of 
freedom (n — 1 independent deviations from j,) and, hence, 62, is based 
on k(n — 1) degrees of freedom. Now, the variance of the k sample means 
is given by 


: ) 
Pas So 


and if the null hypothesis is true it estimates o7/n. Thus, an estimate of a? 

based on the differences among the sample means is given by 

(y: — y.)” 
| 


61 ps) = ne = 


we 


ll 
se 


I 


and it is based on k — 1 degrees of freedom. 
If the null hypothesis is true, it can be shown that 2, and 62 are in- 
dependent estimates of a”, and it follows that 
ee: 


A 


Oy 
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is a value of a random variable having the F distribution with k — 1 and 
k(n — 1) degrees of freedom. Since the between sample variance, 63, can 
be expected to exceed the within sample variance, Gj, when the null 
hypothesis is false, the null hypothesis will be rejected if F exceeds F,, 
where F, is obtained from Table 6 with k — 1 and k(m — 1) degrees of 
freedom. 

The preceding argument has shown how the test of the equality of k 
means can be based on the comparison of two variance estimates. More 
remarkable, perhaps, is the fact that the two estimates in question [except 
for the divisors k — 1 and k(n — 1)] can be obtained by “breaking up” or 
analyzing the total variance of all mk observations into two parts. The 
sample variance of all nk observations is given by 


and with reference to its numerator, called the total sum of squares, we 
shall now prove the following theorem. 





THEOREM 12.1 


n 


oS YOu - I) p> Su -5 +0 YG IP 


I=1 j 





The proof of this theorem is based on the identity 
Vip — I= VG II+G:-J) 


Squaring both sides and summing on i and j, we obtain 


k 
+2), YOu VIMY; — Y-) 
Next, we observe that 
k on : _ = k n 
by 24 = Ir I) = 2 DD Bah fT ae P)J=9 


since y, is the mean of the ith sample and, hence, 3 (y;; —¥p) = 0 for 
j=1 


all i. To complete the proof of Theorem 12.1, we have only to observe that 
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the summand of the second sum on the right-hand side of the original 
identity does not involve the subscript j and that, consequently, 


7% 


YY GIP =e EGFP 


It is customary to denote the total sum of squares, the left-hand member 
of the identity of Theorem 12.1, by SST. The first term on the right-hand 
side is G2, times its degrees of freedom, and we refer to this sum as the 
error sum of squares, SSE. The term “error sum of squares” expresses the 
idea that the quantity estimates random (or chance) error. The second term 
on the right-hand side of the identity of Theorem 12.1 is G2 times its degrees 
of freedom, and we refer to it as the between-samples sum of squares or the 
treatment sum of squares, SS(Tr). (Most of the early applications of this 
kind of analysis were in the field of agriculture, where the k populations 
represented different treatments, such as fertilizers, applied to agricultural 
plots.) Note that in this notation the F ratio on page 337 can be written 





— SS(Tr)/(k — 1) 


f= eR — 1). 


The sums of squares required for substitution into this last formula are 
usually obtained by means of the following short-cut formulas, which the 
reader will be asked to verify in Exercise 12 on page 345. We first calculate 
SST and SS(Tr) by means of the formulas 





where C, called the correction term, is given by 









T 2 


C= 


In these formulas, T, is the total of the n observations in the ith sample 
while 7 is the grand total of all kn observations. The error sum of squares, 
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SSE, is then obtained by subtraction; according to Theorem 12.1 we can 
write 


SSE = SST — SS(Tr) 





The results obtained in analyzing the total sum of squares into its 
components are conveniently summarized by means of the following kind 
of analysis of variance table: 


Source of Degrees of | Sum of 
variation freedom squares Mean square 


Treatments MS(Tr) MS(Tr) 
= SS(Tr)/(k — 1) 


= SSE/k(a — 1) 





Note that each mean square is obtained by dividing the corresponding sum 
of squares by its degrees of freedom. 


Example. To illustrate the analysis of variance (as this technique is 
appropriately called) for a one-way classification, suppose that in 
accordance with the layout on page 333 each laboratory measures the 
tin-coating weights of 12 discs and that the results are as follows: 


Laboratory A Laboratory B Laboratory C Laboratory D 





0.25 0.18 0.19 0.23 
0.27 0.28 0.25 0.30 
0.22 0.21 0.27 0.28 
0.30 0.23 0.24 0.28 
0.27 0.25 0.18 0.24 
0.28 0.20 0.26 0.34 
0.32 0.27 0.28 0.20 
0.24 0.19 0.24 0.18 
0.31 0.24 0.25 0.24 
0.26 0.22 0.20 0.28 
0.21 0.29 O25 0.22 


0.28 0.16 0.19 0.24 
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The totals for the four samples are, respectively, 3.21, 2.72, 2.76, and 
3.00, the grand total is 11.69, and the calculations required to obtain 
the necessary sums of squares are as follows: 


(11.69)? _ 
Ca = 2.8470 
SST = (0.25)? + (0.27)? + ... + (0.21)? — 2.8470 = 0.0809 


CC ee oa ee i a) i + C.76)" + C.00)" — 9.8470 = 0.0130 


SSE = 0.0809 — 0.0130 = 0.0679 





Thus, we get the following analysis of variance table: 


Source of Degrees of Sum of Mean 
variation freedom squares square 


Laboratories 





Since the value obtained for F exceeds 2.82, the value of F,,; with 3 
and 44 degrees of freedom, the null hypothesis can be rejected at the 
0.05 level of significance; we conclude that the laboratories are not 
obtaining consistent results. 


To estimate the parameters i, &1, %», %3, and 0, (Or 11, Ua, 3, and L,), 
we can use the method of least squares, minimizing 


n 


k 
>» ps (Viz; —  — 0)? 


k 
with respect to w and the a;, subject to the restriction that 5} a; = 0. 
t=1 


This may be done by eliminating one of the «’s or, better, by using the 
method of Lagrange multipliers, which is treated in most texts on advanced 
calculus. In either case we obtain the “intuitively obvious” estimates @ = 
y, and &,=y;, —y, fori=1,2,...,k, and the corresponding estimates 
for the yw; are given by #2; = j,. 


Continuation of Example. Thus, for the data from the four labora- 
tories we get @ = “EO? = 0,244, &, = =2' — 0.244 = 0.024, @, = 
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_ —0.244=—0.017, &, = _ 024001 ade: = 
5e 0.244 = 0.006. 


The analysis of variance described in this section applies to one-way 
classifications where each sample has the same number of observations. 
If this is not the case and the sample sizes are n,,n,.,..., andn,, we have 


k 

only to substitute NV = >) n, for nk throughout and write the comput- 
i=l 

ing formulas for SST and S'S(7Tr) as 


kK nm 
SST = 2 pb teen © 


i=1 j=1 





Otherwise, the procedure is the same as before. (See also Exercise 13 on 
page 345.) 


EXERCISES 


1. An experiment is performed to compare the cleansing action of two deter- 
gents, Detergent A and Detergent B. Twenty swatches of cloth are soiled 
with dirt and grease, each is washed with one of the detergents in an agitator- 
type machine, and then measured for “whiteness.” Criticize the following 
aspects of the experiment: 

(a) The entire experiment is performed with soft water. 

(b) Fifteen of the swatches are washed with Detergent A and five with 
Detergent B. 

(c) To accelerate the testing procedure, very hot water and 30-second wash- 
ing times are used in the experiment. 

(d) The “whiteness” readings of all swatches washed with Detergent A are 
taken first. 


~ 


A certain bon vivant, wishing to ascertain the cause of his frequent hangovers, 
conducted the following experiment. On the first night, he drank nothing 
but whiskey and water; on the second night he drank vodka and water; on 
the third night he drank gin and water; and on the fourth night he drank rum 
and water. On each of the following mornings he had a hangover, and he 
concluded that it was the common factor, the water, that made him ill. 
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(a) This conclusion is obviously unwarranted, but can you state what prin- 
ciples of sound experimental design are violated ? 

(b) Give a less obvious example of an experiment having the same short- 
comings. 

(c) Suppose our friend had modified his experiment so each of the four 
alcoholic beverages was used both with and without water, so that the 
experiment lasted eight nights. Could the results of this enlarged experi- 
ment serve to support or refute the hypothesis that water was the cause 
of the hangovers? Explain. 


3. To compare the effectiveness of three methods of teaching the programming 
of acertain computer—Method A which is straight teaching-machine instruc- 
tion, Method B which involves the personal attention of an instructor and 
some direct experience working with the computer, and Method C which 
involves the personal attention of an instructor but no work with the com- 
puter—random samples of size four are taken from large groups of persons 
taught by the three methods and the following are the scores which they 
obtained in an appropriate achievement test: 


Method A Method B Method C 


73 91 te 
vy 81 77 
67 87 76 
71 85 2 


kK on 
(a) Without using the short-cut formulas, calculate >) }) 0, — 7), 
t=1 j=1 


k k 
>» DY Oy —y)?, and n> (¥; —y.)?, and verify the identity of 
i=1 


i=1 j=l 
Theorem 12.1. 

(b) Verify the results obtained for the three sums of squares by using the 
short-cut formulas on pages 339 and 340. 


4. Using the sums of squares obtained in Exercise 3, test at the level of signifi- 
cance & = 0.05 whether the differences among the means obtained for the 
three samples are significant. 


5. The following are the number of mistakes made in five successive days by 
four technicians working for a photographic laboratory: 


Technician I Technician IT Technician II Technician IV 





6 14 10 9 
14 9 12 12 
10 i2 fi 8 

8 10 15 10 


11 14 11 El 


Test at the level of significance & = 0.01 whether the differences among the 
four sample means can be attributed to chance. 
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6. The following are the weight losses of certain machine parts (in milligrams) 
due to friction, when three different lubricants were used under controlled 
conditions: 


Lubricant A: 12.2, 11.8, 13.1, 11.0, 3.9, 4.1, 10.3, 8.4 
Lubricant B: 10.9, 5.7, 13.5, 9.4, 11.4, 15.7, 10.8, 14.0 
Lubricant C: 12.7, 19.9, 13.6, 11.7, 18.3, 14.3, 22.8, 20.4 


(a) Test at the 0.01 level of significance whether the differences among the 
sample means can be attributed to chance. 

(b) Estimate the parameters of the model used in the analysis of this experi- 
ment. 


7. To determine the effect on exit dust loading in a precipitator, the following 
measurements were made: 





Total flow Exist dust loading 

(cu ft/hr) (grains per cubic yard in flue gas) 
200 1.5 is? 1.6 1.9 1.9 
300 Le 1.8 2.2 1.9 z2 
400 1.4 1.6 1 1.5 Ls 
500 1.1 13 1.4 1.4 2.0 


Use the level of significance & = 0.05 to test whether the flow through the 
precipitator has an effect on the exit dust loading. 


8. To study the performance of a newly-designed motorboat it was timed over 
a marked course under various wind and water conditions. Use the following 
data (in minutes) to test the null hypothesis that the boat’s performance is 
not affected by the differences in wind and water conditions: 


Calm conditions: 20, 17, 14, 24 
Moderate conditions: 21, 23, 16, 25, 18, 23 
Choppy conditions: 26, 24, 23, 29, 21 


Use the level of significance & = 0.05. 


9, To find the best arrangement of instruments on a control panel of an air- 
plane, three different arrangements were tested by simulating an emergency 
condition and observing the reaction time required to correct the condition. 
The reaction times (in tenths of a second) of 28 pilots (randomly assigned to 
the different arrangements) were as follows: 


Arrangement 1: 14, 13, 9, 15, 11, 13, 14, 11 
Arrangement 2: 10, 12, 9, 7, 11, 8, 12, 9, 10, 13, 9, 10 
Arrangement 3: 11, 5, 9, 10, 6, 8, 8, 7 
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Test at the level of significance & = 0.01 whether we can reject the null 
hypothesis that the differences among the arrangements have no effect. 


10. Referring to the discussion on page 332, assume that the standard deviations 
of the tin-coating weights determined by any one of the three laboratories 
have the common value 0 = 0.012, and that it is desired to be 95 percent 
confident of detecting a difference in means between any two of the labora- 
tories in excess of 0.01 pound per base box. Show that these assumptions 
lead to the decision to send a sample of 12 discs to each laboratory. 


11. Show that if “4; = w+ 4;, and mu is the mean of the y;, it follows that 
k 
> a; = 0. 
i=1 

12. Verify the short-cut formulas for computing SST and SS(Tr) given on page 
339. 


13. State and prove a theorem analogous to Theorem 12.1 for the case where the 
size of the ith sample is n;; that is, where the sample sizes are not necessarily 
equal. 

14. Samples of peanut butter produced by two different manufacturers are 
tested for aflatoxin content, with the following results: 


Aflatoxin content (ppb) 


Brand A Brand B 
0.5 4.7 
0.0 GZ 
3,2 0.0 
1.4 10.5 
0.0 2.1 
tO 0.8 
8.6 
2.9 


(a) Use the method analysis of variance to test whether the two brands differ 
in aflatoxin content. 

(b) Test the same hypothesis using a two-sample f-test. 

(c) It can be shown that the ¢ and F statistics were related by the formula 


F(1,v) = t2() 


where v = degrees of freedom. Using this result, prove that the analysis 
of variance and two-sample f-test methods are equivalent in this case. 
12.3 RANDOMIZED BLOCK DESIGNS 


As we observed in Section 12.1, the estimate of chance variation (the 
experimental error) can often be reduced, that is, freed of variability due to 
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extraneous causes, by dividing the observations in each classification into 
blocks. This is accomplished when known sources of variability (that is, 
extraneous variables) are held fixed in each block, but vary from block to 
block. 

In this section we shall suppose that the experimenter has available 
measurements pertaining to a treatments distributed over b blocks. First, 
we shall consider the case where there is exactly one observation from each 
treatment in each block; with reference to the illustration on page 335, 
this case would arise if each laboratory tested one disc from each strip. 
Letting y,, denote the observation pertaining to the 7th treatment and the 
jth block j, the mean of the b observations for the ith treatment, y_, the 
mean of the a observations in the jth block, and y,, the grand mean of all 
the ab observations, we shall use the following layout for this kind of 
two-way Classification: 


Blocks 
B, Bz ee doy seg “Dk Means 
Treatment I: V14ts Vidya« so Vife sess PIB Vi. 
Treatment 2: V5 VI0s cs og POjfe ws » 9 VOB y2. 
Treatment i: Wiis Vee ssne Vife 6 ona VR Ji. 
Treatment a: Void Vitis 0a Dats cw nn deb Ya. 
Means Yur Via ees Vag vee Vo Ve 


This kind of arrangement is also called a randomized block design, since the 
treatments are presumably allocated at random within each block. Note 
that when a dot is used in place of a subscript, this means that the mean is 
obtained by summing over that subscript. 

The underlying model which we shall assume for the analysis of this 
kind of experiment with one observation per “cell” (that is, there is one 
observation corresponding to each treatment within each block) is given 
by 





Pip = DP Oe Pe Pep (Ott = 1 2e 45058 SS Lo dcese? 





Here y is the grand mean, «; is the effect of the ith treatment, 8, is the 
effect of the jth block, and the e€;, are values of independent, normally 
distributed random variables having zero means and the common variance 
o?. Analogous to the model for the one-way classification, we restrict the 
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a b 
parameters by imposing the conditions that }) «,=0 and >> #B, =0 
i=1 j=1 


(see Exercise 10 on page 356). 

In the analysis of a two-way classification where each treatment is 
represented once in each block, the major objective is to test for the signifi- 
cance of the difference among the y,, that is, to test the null hypothesis 


Gp S05 .5.=% =V 


In addition, it may also be desirable to test whether the blocking has been 
effective, that is, whether the null hypothesis 


6, =f,=...=f,=0 


can be rejected. In either case, the alternative hypothesis is that at least one 
of the effects is different from zero. 

As in the one-way analysis of variance, we shall base these significance 
tests on comparisons of estimates of @*—one based on the variation among 
treatments, one based on the variation among blocks, and one measuring 
the experimental error. Note that only the latter is an estimate of gd? when 
either (or both) of the null hypotheses do not hold. The required sums of 
squares are given by the three components into which the total sum of 
squares is “broken up” by means of the following theorem: 


‘THEOREM 129 


a 


SY Ou 9.7 = Y0u-3 Vi —yyty..)? 


b 
+b. - 3. $4 MGs — 9.) 


The left-hand side of this identity represents the total sum of squares, SST, 
and the terms on the right-hand side are, respectively, the error sum of 
squares, SSE, the treatment sum of squares, SS(7r), and the block sum of 
squares SS(B/). To prove this theorem, we make use of the identity 


Vu L. = (yy — oe +yVI+G:.-ID+O3- J.) 


and follow essentially the same argument as in the proof of Theorem 12.1. 
In practice, we calculate the necessary sums of squares by means of 
short-cut formulas analogous to those on page 339, rather than by using 
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the expressions which define these sums of squares in Theorem 12.2. We 
first calculate SST, SS(Tr), and SS(B/) by means of the formulas 





SS(BI) = £ 






where C, the correction term, 


In these formulas T, is the sum of the b observations for the ith treatment, 
T_, is the sum of the a observations in the jth block, and T_, is the grand 
total of all the observations. Note that the divisors for SS(7r) and SS(B/) 
are the number of observations in the respective totals, 7; and T_,. The 
error sum of squares is then obtained by subtraction; according to 
Theorem 12.2 we can write 


SSB = SST = SSC) SSB) 


In Exercise 11 on page 356, the reader will be asked to verify that all these 
computing formulas are, indeed, equivalent to the corresponding terms of 
the identity of Theorem 12.2. 

Using these sums of squares, we can reject the null hypothesis that the 
a, are all equal to zero at the level of significance a if 


P= MS(Tr) _ _- SS(Tr)/(a — 1) 


MSE ~ SSE\(a — Ib — 1) 





exceeds F, with a — 1 and (a — 1)(6 — 1) degrees of freedom. The null 
hypothesis that the £, are all equal to zero can be rejected at the level of 
significance o if | 
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MS(BI) _—-SS(BI)/(b — 1) 
MSE ~ SSEa— Db—)D 


Fy = 





exceeds F, with b — 1 and (a — 1)(b — 1) degrees of freedom. Note that 
the mean squares, MS(Tr), MS(B/), and MSE, are again defined as the 
corresponding sums of squares divided by their degrees of freedom. 

The results obtained in this analysis are summarized in the following 
analysis of variance table: 


Source of Degrees of Sum of 
variation freedom squares Mean square F 


SS(Tr) | MS(Tr) = ae 
SS(BI) 


SS(BI) | MS(Bl) = G>* 





Example. Let us illustrate the analysis of a two-way classification with 
one observation for each treatment in each block, by considering an 
experiment designed to study the performance of four different deter- 
gents. The following “whiteness” readings were obtained with specially 
designed equipment for 12 loads of washing distributed over three 
different models of washing machines: 





Machine 1  Machine2  Mchine 3 Totals 
Detergent A 139 
Detergent B 145 
Detergent C 153 
Detergent D 128 
Totals 182 176 207 565 


Looking upon the detergents as treatments and the machines as blocks, 
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we obtain the necessary sums of squares as follows: 

363) 

~~ 12 

SST = 457 + 437+ ... + 49? = 26,867 — 26,602 = 265 

139% + 1452 + 153% + 1282 
3 


& = 26,002 


SS(Tr) = = 26,602 == 111 


1827 + 1767 + 2072 
4 


SSE = 265 — 111 — 135 = 19 


SS(Bl) = => 20,002 == 135 


Then dividing the sums of squares by their respective degrees of free- 
dom to obtain the appropriate mean squares, we get the results shown 
in the following analysis of variance table: 


Source of Degrees of Sums of Mean 
variation freedom squares square F 
Detergents 3 111 37.0 11.6 
Machines 2 135 67.5 21d 
Error 6 9 3:2 
Total iB! 265 


Since F,, = 11.6 exceeds 9.78, the value of Fy ,; with 3 and 6 degrees 
of freedom, we conclude that there are differences in the effectiveness of 
the four detergents. Also, since F,, = 21.1 exceeds 10.9, the value of 
F, 9, with 2 and 6 degrees of freedom, we conclude that the differences 
among the results obtained for the three washing machines are signi- 
ficant, namely, that the blocking has been effective. To make the effect 
of this blocking even more evident, the reader will be asked to verify 
in Exercise 6 on page 355 that the test for differences among the deter- 
gents would not yield significant results if we looked upon the data as 
a one-way classification. 

The effect of the ith detergent can be estimated by means of the 
formula &; = y; — y_, which may be obtained by the method of least 
squares. The resulting estimates are 


&, = 46.3 — 47.1 = —0.8, t= 483 = 47.1 = 12, 
&,; = 51.0 — 47.1 = 3.9, @, = 42.7 —47.1 = —4.4 
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It should be observed that a two-way classification automatically allows 
for repetitions of the experimental conditions; for example, in the pre- 
ceding experiment each detergent was tested three times. Further repeti- 
tions may be handled in several ways, and care must be taken that the 
model used appropriately describes the situation. One way to provide 
further repetition in a two-way classification is to include additional 
blocks—for example, to test each detergent using several additional wash- 
ing machines, randomizing the order of testing for each machine. Note that 
the model remains essentially the same as before, the only change being an 
increase in b, and a corresponding increase in the degrees of freedom for 
blocks and for error. The latter is important, because an increase in the 
degrees of freedom for error makes the test of the null hypothesis «7, = 0 
for all i more sensitive to small differences among the treatment means. In 
fact, the real purpose of this kind of repetition is to increase the degrees of 
freedom for error, thereby increasing the sensitivity of the F tests (see 
Exercise 9 on page 356). 

A second method is to repeat the entire experiment, using a new pattern 
of randomization to obtain a-b additional observations. This is possible 
only if the blocks are identifiable, that is, if the conditions defining each 
block can be repeated. For example, in the tin-coating weight experiment 
described in Section 12.1, the blocks are strips across the rolling direction 
of a sheet of tin plate, and, given a new sheet, it is possible to identify 
which is strip 1, which is strip 2, and so forth. In the example of this section, 
this kind of repetition (usually called replication) would require that the 
settings of the machines be exactly duplicated. This kind of repetition will 
be used in connection with Latin-square designs in Section 12.5; see also 
Exercises 7 and 8 on pages 355 and 356. 

A third method of repetition is to include n observations for each treat- 
ment in each block. When an experiment is designed in this way, the n 
observations in each “cell” are regarded as duplicates, and it is to be 
expected that their variability will be somewhat less than experimental 
error. To illustrate this point, suppose that the tin-coating weights of 
three discs from adjacent positions in a strip are measured in sequence by 
one of the laboratories, using the same chemical solutions. The variability 
of these measurements will probably be considerably less than that of three 
discs from the same strip measured in that laboratory at different times, 
using different chemical solutions, and perhaps different technicians. The 
analysis of variance appropriate for this kind of repetition reduces essen- 
tially to a two-way analysis of variance applied to the means of the n dupli- 
cates in the a-b cells; thus, there would be no gain in degrees of freedom for 
error, and, consequently, no gain in sensitivity of the F tests. It can be 
expected, however, that there will be some reduction in the error mean 
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square, since it now measures the residual variance of the means of several 
observations. 


12.4 MULTIPLE COMPARISONS 


The F tests used so far in this chapter showed whether differences 
among several means are significant, but they did not tell us whether a 
given mean (or group of means) differs significantly from another given 
mean (or group of means). In actual practice, the latter is the kind of 
information an investigator really wants; for instance, having determined 
on page 341 that the means of the tin-coating weights obtained by the four 
laboratories differ significantly, it may be important to find out which 
laboratory (or laboratories) differs from which others. 

If an experimenter is confronted with k means, it may seem reasonable 
at first to test for significant differences between all possible pairs, that is, to 


perform 4 — Mea) two-sample ¢ tests as described on page 218. 


Aside from the fact that this would require a large number of tests even if 
k is relatively small, these tests would not be independent, and it would be 
virtually impossible to assign an overall level of significance to this pro- 
cedure. 

Several multiple-comparisons tests have been proposed to overcome 
these difficulties, among them the Duncan multiple-range test, which we 
shall study in this section. (References to other multiple-comparisons tests 
are mentioned in the book by W. T. Federer listed in the Bibliography.) 
The assumptions underlying the Duncan multiple-range test are, essen- 
tially, those of the one-way analysis of variance for which the sample sizes 
are equal. The test compares the range of any set of p means with an 
appropriate Jeast significant range, R,, given by 


Here s, is an estimate of 0, = a/,/n, and it is computed by means of the 
formula 


F SE 
S, = 
n 
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where MSE is the error mean square in the analysis of variance. The value 
of r,, depends on the desired level of significance and the number of degrees 
of freedom corresponding to MSE, and it may be obtained from Tables 
12(a) and (b) for aw = 0.05 and 0.01, for p = 2, 3,..., 10, and for various 
degrees of freedom from 1 to 120. 


Example. To illustrate this multiple-comparisons procedure, let us 
refer to the tin-coating data on page 340 and arrange the four sample 
means as follows in an increasing order of magnitude: 


Laboratory B C D A 





Mean 0.227 0.230 0.250 0.268 


Next, we compute s,, using the error mean square of 0.0015 in the 
analysis of variance on page 341, and we obtain 


(0.0015 
Se = 4/45 = 0.011 


Then, we get (by linear interpolation) from Table 12(a) the following 
values of r, for « = 0.05 and 44 degrees of freedom: 


D ye 3 4 
is 2.85 3.00 3.09 


Multiplying each value of r, by s; = 0.011, we finally obtain 
Dp 2 3 4 
~~ Rp 0.031 0.033 0.034 


The range of all four means is 0.268 — 0.227 = 0.041, which 
exceeds R, = 0.034, the least significant range. This result should have 
been expected, since the F test on page 341 showed that the differences 
among all four means are significant at « = 0.05. To test for significant 
differences among three adjacent means, we obtain ranges of 0.038 and 
0.023, respectively, for 0.230, 0.250, 0.268 and 0.227, 0.230, 0.250. Since 
the first of these values exceeds R,; = 0.033, the differences observed in 
the first set are significant; since the second value does not exceed 0.033, 
the corresponding differences are not significant. Finally, for adjacent 
pairs of means we find that no adjacent pair has a range greater than 
the least significant range Re = 0.031. All these results can be sum- 
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marized by writing 
0.227 0.230 0.250 0.268 


where a line is drawn under any set of adjacent means for which the 
range is less than the appropriate value of R,, that is, under any set of 
adjacent means for which differences are not significant. We thus con- 
clude in our example that Laboratory A averages higher tin-coating 
weights than laboratories B and C. 


Example. If we apply this same method to the example of Section 12.3, 
where we compared the four detergents, we obtain (see also Exercise 12 
on page 356) 


Detergent 


D A B C 
42.7 46.3 48.3 51.0 


In other words, among triplets of adjacent means both sets of dif- 
ferences are significant. So far as pairs of adjacent means are concerned, 
we find that only the difference between 42.7 and 46.3 is significant. 
Interpreting these resuits, we conclude that detergent D is significantly 
inferior to any of the others. 


EXERCISES 


1. A laboratory technician measures the breaking strength of each of five kinds 
of linen threads by means of four different instruments, and obtains the fol- 
lowing results (in ounces): 


Measuring instruments 
I; In I; I4 


Thread 1 


Thread 2 


Thread 3 


Thread 4 


Thread 5 
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Looking upon the threads as treatments and the instruments as blocks, 
perform an analysis of variance at the level of significance & = 0.01. 


2. Looking at the days (rows) as blocks, rework Exercise 5 on page 343 by the 
method of Section 12.3. 


3. Four different, though supposedly equivalent, forms of a standardized 
reading achievement test were given to each of five students, and the follow- 
ing are the scores which they obtained: 


Student 1 Student 2 Student 3 Student 4 Student 5 


Form A 75 59 69 84 


Form D 73 67 62 719 95 


Perform a two-way analysis of variance to test at the level of significance 
& = 0.01 whether it is reasonable to treat the four forms as equivalent. 


4. An experiment was performed to judge the effect of four different fuels and 
two different types of launchers on the range of a certain rocket. Test, on the 
basis of the following data (in nautical miles) whether there are significant 
differences (a) among the means obtained for the fuels, and (b) between the 
means obtained for the launchers: 


Fuel I Fuel II Fuel II Fuel IV 


Launcher X 


Launcher Y 





Use the level of significance & = 0.05. 


5, Referring to Exercise 7 on page 344, suppose the data in each of the five 
columns were obtained from a different precipitator. Repeat the analysis of 
variance, treating the experiment as a two-way classification, and observe 
what change results in the error mean square. 


6. To emphasize the importance of “blocking,” reanalyze the “whiteness” data 
on page 349 as a one-way classification with the four detergents being the 
different treatments. 

7. If, in a two-way classification, the entire experiment is repeated r times, the 
model becomes ) 


Vijze = WG + By H+ Pe + €ije 
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10. 


11. 


12. 


for i= 1, Z46666F = 2Z,os%, 0, 00d k= 1, 2,....% where thesum of 

the &’s, the sum of the f’s, and the sum of the p’s are equal to zero. The 

€;;, are again values of independent normally distributed random variables 
with zero means and the common variance G2. 

(a) Write down (but do not prove) an identity analogous to the one of 
Theorem 12.2, which subdivides the total sum of squares into compo- 
nents attributable to treatments, blocks, replicates, and error. 

(b) Generalize the computing formulas on page 348, so that they apply to 
a replicated randomized block design. Note that the divisor in each case 
equals the number of observations in the respective totals. 

(c) If the number of degrees of freedom for the replicate sum of squares is 
r — 1, how many degrees of freedom are there for the error sum of 
squares ? 


. The following are the number of defectives produced by four workmen 


operating, in turn, three different machines; in each case, the first figure 
represents the number of defectives produced on a Friday and the second 
figure represents the number of defectives produced on the following 
Monday: 


Workman 


B, 
Aj 37, 43 
Machine A2 a1, 30° 


A3 36, 40 





Use the theory developed in Exercise 7 to analyze the combined figures for 
the two days as a two-way classification with replication. Use the level of 
significance & = 0.05. 


. As was pointed out on page 351, two ways of increasing the size of a two-way 


classification experiment are (a) to double the number of blocks, and (b) to 
replicate the entire experiment. Discuss and compare the gain in degrees of 
freedom for the error sum of squares by the two methods. 


Show that if uw;; = u + 0; + f;, the mean of the y,;; (summed on /) is equal 
to uw + ;, and the mean of the y;; summed on i andj is equal to yw, it follows 


a b 
that x 0; 2 Be =. 


Verify that the computing formulas for SST, SS(7r), SS(B/), and SSE, given 
on page 348, are equivalent to the corresponding terms of the identity of 
Theorem 12.2. 


Verify the results of Duncan’s test for the comparison of the four detergents, 
given on page 354. 
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13. Use the Duncan test with & = 0.05 to compare the effectiveness of the three 
methods of teaching the programming of the computer in Exercise 3 on page 
343. 


14. Use the Duncan test with « = 0.05 to compare the effectiveness of the three 
lubricants in Exercise 6 on page 344. 


15. Use the Duncan test with « = 0.01 to compare the strength of the five linen 
threads in Exercise 1 on page 354. 


16. Does it make sense to use the Duncan test with & = 0.05 to compare the 
results obtained with the four fuels in Exercise 4 on page 355? Why? 


12.5 SOME FURTHER EXPERIMENTAL 
DESIGNS 


The randomized-block design of Section 12.4 is appropriate when one 
extraneous source of variability is to be eliminated in comparing a set of 
sample means. An important feature of this kind of design is its balance, 
achieved by assigning the same number of observations for each treatment 
to each block. (In this connection, see also the comment on page 335, 
where we pointed out that differences due to blocks will not affect the 
means obtained for the different treatments.) The same kind of balance can 
be attained in more complicated kinds of designs, where it is desired to 
eliminate the effect of several extraneous sources of variability. In this 
section we shall introduce two further balanced designs, the Latin-square 
design and the Graeco-Latin-square design, which are used to eliminate 
the effects of two and three extraneous sources of variability, respectively. 


Example. To introduce the Latin-square design, suppose it is desired 
to compare three treatments, A, B, and C, in the presence of two other 
sources of variability. For example, the three treatments may be three 
methods for soldering copper electrical leads, and the two extraneous 
sources of variability may be (1) different operators doing the soldering, 
and (2) the use of different solder fluxes. If three operators and three 
fluxes are to be considered, the experiment might be arranged in the 
following pattern: 


Flux 1 Flux 2 Flux 3 
Operator 1 A B C 
Operator 2 C A B 


Operator 3 B GC A 
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Here each soldering method is applied once by each operator in con- 
junction with each flux, and if there are systematic effects due to dif- 
ferences among operators or differences among fluxes, these effects 
are present equally for each treatment, that is, for each method of 
soldering. 


An experimental arrangement such as the one just described is called a 
Latin square. Ann X n Latin square is a square array of n distinct letters, 
with each letter appearing once and only once in each row and in each 
column. Examples of Latin squares with nm = 4 and nm = 5 are shown in 
Figure 12.2, and larger ones may be obtained from the book by W. G. 
Cochran and G. M. Cox mentioned in the Bibliography. Note that in a 
Latin-square experiment involving nv treatments, it is necessary to include 
n* observations, n for each treatment. 





Figure 12.2. Latin squares. 


As we shall see on page 360, a Latin-square experiment without repli- 
cation provides only (n — 1)(n — 2) degrees of freedom for estimating the 
experimental error. Thus, such experiments are rarely run without replica- 
tion if n is small, that is, without repeating the entire Latin-square pattern 
several times. If there is a total of r replicates, the analysis of the data 
presumes the following model, where y;,j,); is the observation in the ith 
row and the jth column of the /th replicate, and the subscript k, in paren- 
theses, indicates that it pertains to the kth treatment: 





Vig = H+O;,+ By tye + prt Ci i(k)1 
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for 7,.j,% = 1, 2y¢045%) and f= 1, 2... 2575 subject to the restrictions 
that }) a; = 0, 3 fa, 3 y, = 0, and 3 p; = 0. Here yz is the grand 
i=1 j= k=1 I=] 


mean, @, is the ith row effect, B, is the jth column effect, y, is the kth treat- 
ment effect, p, is the effect of the /th replicate, and the €;;;,), are values of 
independent, normally distributed random variables with zero means and 
the common variance a”. Note that by “row effects” and “column effects” 
we mean the effects of the two extraneous variables, and that we are in- 
cluding replicate effects, since, as we shall see, replication can introduce a 
third extraneous variable. Note also that the subscript & is in parentheses 
iN Yij(¢, because, for a given Latin-square design, & is automatically 
determined when i and j are known. 

The main hypothesis we shall want to test is the null hypothesis y, = 0 
for all k, namely, the null hypothesis that there is no difference in the 
effectiveness of the n treatments. However, we can also test whether the 
“cross blocking” of the Latin-square design has been effective; that is, we 
can test the two null hypotheses «, = 0 for all i and f, = 0 for all j 
(against suitable alternatives) to see whether the two extraneous variables 
actually have an effect on the phenomenon under consideration. Further- 
more, we can test the null hypothesis p, = 0 for all / against the alternative 
that the p, are not all equal to zero, and this test for replicate effects may be 
important if the parts of the experiment representing the individual Latin 
squares were performed on different days, by different technicians, at 
different temperatures, and so on. 





SSR = ee 3 Tt? = € (for row effects) 
SSC = ~ pe i ae © (for column effects) 
SS(Rep) = =) T*7,-—C (for replicates) 


SST = > = > Viste — 
SSE = SST — SS(Tr) — SSR — SSC — SS(Rep) 
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The sums of squares required to perform these tests are usually 
obtained by means of the above short-cut formulas, where T, is the 
total of the r-m observations in all of the ith rows, T_, is the total of the 
ren observations in all of the jth columns, T_, is the total of the n* observa- 
tions in the /th replicate, T,,, is the total of all the r-m observations per- 
taining to the Ath treatment, and 7. is the grand total of all the r-n? 
observations: 

Note that again each divisor equals the number of observations in the 
corresponding squared totals. Finally, the results of the analysis are as 
shown in the following analysis of variance table: 











Source of Degrees of Sum of 
variation freedom squares Mean square F 
Treatments n—1 SS(Tr) | MS(Tr) MS(Tr) 
_ SSr) MSE 
n—1 
Rows n—1 SSR MSR MSR 
ni 
Columns n—1 SSC MSC MSC 
nil 
Replicates r—1 SS(Rep) | MS(Rep) MS(Rep) 
_ SS(Rep) MSE 
r—l 





As before, the degrees of freedom for the total sum of squares equals the 
sum of the degrees of freedom for the individual components; thus, the 
degrees of freedom for error are usually found last, by subtraction. 


Example. To illustrate the analysis of a replicated Latin-square experi- 
ment, let us suppose that two replicates of the soldering experiment 
were run, using the following arrangement: 


361 


12.5 Some Further Experimental Designs 


Replicate I Replicate II 
flux flux 
1 2 3 I Z 5 
Operator 1 A os 
Operator 2 C B 
Operator 3 B A 





The results, showing the number of pounds tensile force required to 
separate the soldered leads, were as follows: 


Replicate I Replicate II 





The total for the first treatment (Method A) is 
14.0 + 17.0 + 13.5 + 13.0 + 12.0 + 18.0 = 87.5 
while the totals for the other two treatments (Methods B and C) are 


16.5 + 15.0 + 11.0 + 16.5 + 14.0 + 13.5 = 86.5 
and 
110+ 95+ 12.0 + 10.0 + 12.0 + 11.5 = 66.0 


respectively. Also, the totals for the three rows are 81.0, 79.5, and 79.5, 
those for the three columns are 70.0, 92.0, and 78.0, the totals for the 
two replicates are 119.5 and 120.5, and the grand total is 240. We thus 
obtain 

_ (240) _ 


C= ae = 3200.0 


SS(Tr) = 1{(87.5)? + (86.5)? + (66.0)2] — 3200.0 = 49.1 
SSR = }[(81.0)? + (79.5)? + (79.5)] — 3200.0 = 0.2 
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SSC = 4{(70.0)? + (92.0)? + (78.0)?] — 3200.0 = 41.3 
SS(Rep) = 4(119.5)? + (120.5)?] — 3200.0 = 0.1 

SST = (14.0)? + (16.5)? +... + (11.5)? — 3200.0 = 104.5 

SSE = 104.5 — 49.1 — 41.3 — 0.2 — 0.1 = 13.8 


and the results are as shown in the following analysis of variance table: 


Source of Degrees of Sum of Mean 
variation freedom squares Square 





Since the F ratios for methods and fluxes both exceed 7.56, the value of 
F ,, for 2 and 10 degrees of freedom, the differences due to methods as 
well as fluxes are significant. As can be seen by inspection, the differences 
due to the other two sources of variation (operators and replicates) 
are not significant. To go one step further, the Duncan multiple-range 
test of Section 12.4 gives the following decision pattern at the 0.01 level 
of significance: 


Method C Method B Method A 
Mean 11.0 14.4 14.6 


Thus, we conclude that Method C definitely yields weaker solder bonds 
than Methods A or B. 


The elimination of three extraneous sources of variability can be 
accomplished by means of a design called a Graeco-Latin square. This 
design is a square array of m Latin letters and n Greek letters, with the 
Latin and Greek letters each forming a Latin square; furthermore, each 
Latin letter appears once and only once in conjunction with each Greek 
letter. The following is an example of a 4 x 4 Graeco-Latin square: 
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The construction of Graeco-Latin squares, also called orthogonal Latin 
squares, poses interesting mathematical problems, some of which are 
mentioned in the book by H. B. Mann listed in the Bibliography. 


Example. To give an example where the use of a Graeco-Latin square 
might be appropriate, suppose that in the soldering example an addi- 
tional source of variability is the temperature of the solder. If three 
solder temperatures, denoted by «, 8, and y, are to be used together 
with the three methods, three operators (rows), and three solder fluxes 
(columns), a replicate of a suitable Graeco-Latin-square experiment 
could be laid out as follows: 


Flux 1 Flux 2 Flux 3 


Operator I 
Operator 2 


Operator 3 





Thus, Method A would be used by operator 1 using solder flux 1 and 
temperature «, by operator 2 using solder flux 2 and temperature £, 
and by operator 3 using solder flux 3 and temperature y. Similarly, 
Method B would be used by operator 1 using solder flux 2 and tem- 
perature y, and so forth. 


In a Graeco-Latin square each variable (represented by rows, columns, 
Latin letters, or Greek letters) is “distributed evenly” over the other 
variables. Thus, in comparing the means obtained for one variable, the 
effects of the other variables are all averaged out. The analysis of a Graeco- 
Latin square is similar to that of a Latin square, with the addition of an 
extra source of variability corresponding to the Greek letters. 

There exists a wide variety of experimental designs, other than the ones 
discussed in this chapter, that are suitable for many diverse purposes. 
Among the more widely used designs are the incomplete block designs, 
which are characterized by the feature that each treatment is not repre- 
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sented in each block. If the number of treatments under investigation in 
an experiment is large, it often happens that it is impossible to find homo- 
geneous blocks such that all of the treatments can be accommodated in 
each block. 


Example. For instance, if n paints are to be compared by applying 
each paint to a sheet of metal and then baking the sheets in an oven, it 
may be impossible to put all of the sheets in the oven at one time. 
Consequently, it would be necessary to use an experimental design in 
which k < n treatments (paints) are included in each block (oven run). 
One way of doing this is to assign the treatments to each oven run in 
such a way that each treatment occurs together with each other treat- 
ment in the same number of blocks. For example, form = 4 and k = 
2 we might use the following scheme: 


Oven run Paints 


1 and 2 
3 and 4 
1 and 3 
2 and 4 
1 and 4 
6 2 and 3 


Mm BR WN = 


This kind of design is called a balanced incomplete-block design, and it has 
the important feature that comparisons between any two treatments can be 
made with equal precision. 

Since balanced incomplete-block designs may require too many blocks, 
many other schmes have been developed. Most of these experimental 
designs arose to meet the specific needs of experimenters, notably in the 
field of agriculture. As we have pointed out earlier, much of the language of 
experimental design, including such terms as “treatments,” “blocks,” 
“plots,” etc., has been borrowed from agriculture. Only in recent years 
have the more sophisticated designs been applied to industrial and en- 
gineering experimentation, and, with more widespread application, it is to 
be expected that many new designs will be developed to meet the require- 
ments in these fields. 


EXERCISES 


1. Referring to the problem in which samples of tin plate are to be distributed 
among four laboratories (Section 12.1) suppose that we are concerned with 
systematic differences in tin-coating weight along the direction of rolling as 
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well as across the rolling direction. To eliminate these two sources of vari- 
ability, each of twe sheets of tin plate is divided into 16 parts, representing 
four positions across and four positions along the rolling direction. Then, 
four samples from each sheet are sent to each of the Laboratories A, B, C, and 
D, as shown, and the resulting tin-coating weights are determined. 













































































Replicate I Replicate II 
s A D B Rolling | 2 A C D 
0.20 0,24 0,20 0.27 direction 0.29 0.25 0.18 0.28 
B Cc A D D B A G 
0,28 0.19 0.22 0.28 0.28 0.18 0.21 0.25 
D |B C A C D B A 
0.34 0.23 0.21 0.28 0.28 0.23 0.20 0.28 
A D B C A G D B 
0.32 0.22 0.16 0.27 0.30 0.19 0.24 0.25 


Determine from these data whether the laboratories were obtaining consistent 
results. Also determine whether there are actual tin-coating weight differences 
across and along the rolling direction. (Use the 0.05 level of significance.) 


2. A Latin-square design was used to compare the bond strengths of gold semi- 
conductor lead wires bonded to the lead terminal by five different methods, 
A, B, C, D, and E. The bonds were made by five different operators, and the 
devices were encapsulated using five different plastics, with the following 
results, expressed as pounds force required to break the bond. 


Operators 
O; O2 O3 O4 Os 


Plastics P3 


P4 


Ps 





366 Ch.12 Analysis of Variance 


Analyze these results and apply the Duncan multiple-range test with & = 0.01 
to the mean breaking strengths of the five bonding methods. 


3. A test was made to determine which of three different golf-ball designs, A, B, 
and C will give the greatest distance. Golf balls were driven by three different 
golf pros, P,;, P2, and P3, using three different drivers, D,, D2, and D3. The 
experiment was performed on three different fairways, and the distances from 
the tee to the point where the ball came to rest were measured in yards, as 
follows: 


First fairway Second fairway 





Is any of the golf-ball designs superior to the others with respect to distance? 


4. To study the effectiveness of five different kinds of front-seat passenger 
restraint systems in automobiles, A, B, C, D, and E, the following Graeco- 
Latin-square experiment was performed. The rows represent different auto- 
motive size classes (from sub-compact to full-size), the columns represent 
different barrier impact speeds, and the Greek letters «, 8, y, 6, and € represent 
different impact angles. The experimental results are given in terms of an 
index of forces at critical points on the test dummy that relates to the prob- 
ability of fatal injury. 
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Ao. BB Cy 
0.50 0.21 0.43 





Analyze this experiment. 


5. A clothing manufacturer wishes to determine which of four different needle 
designs is best for his sewing machines. The sources of variability which must 
be eliminated to make this comparison are the actual sewing machine used, 
the operator, and the type of thread. Using the design shown (the rows 
represent the operators, the columns represent the machines, the Latin letters 
represent the needJes, and the Greek letters stand for the types of thread), the 
manufacturer recorded the number of rejected garments at the end of each of 
two weeks, with the following results: 


First week Second week 





Using the 0.05 level of significance, determine whether there is a difference 
among the needles. Also, determine whether there are significant differences 
in the operators, the machines, and the types of thread. 
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6. Give an example of a balanced incomplete block design for which 
(a) n=3 andk =2; (b) n=4andk = 3; 
(c) n=6andk = 4. 
What is the minimum number of times each treatment must be replicated in 
each of these designs? 


12.6 ANALYSIS OF COVARIANCE 


The purpose of the methods of Sections 12.3 and 12.5 was to free the 
experimental error from variability due to identifiable and controllable 
extraneous causes. In this section, we shall introduce a method, called the - 
analysis of covariance, which applies when such extraneous, or concomitant, 
variables cannot be held fixed, but can be measured, nevertheless. This 
would be the case, for example, if we wanted to compare the effectvieness 
of several industrial training programs and the results depended on the 
trainees’ I.Q.’s; if we wanted to compare the durability of several kinds of 
leather soles and the results depended on the weight of the persons wearing 
the shoes: or if we wanted to compare the merits of several cleaning agents 
and the results depended on the original condition of the surfaces cleaned. 

The method by which we analyze data of this kind is a combination of 
the linear regression method of Section 11.1 and the analysis of variance of 
Section 12.2. The underlying model is given by 


Vy = B+, + Oxy + Ey; 





fori=1,2,...,k;7=1,2,...,n. As in the model on page 337, yu is 
the grand mean, «, is the effect of the ith treatment, and the e,, are values 
of independent, normally distributed variables with zero means and the 
common variance o7; as in the model on page 295, 6 is the slope of the 
linear regression equation. 
In the analysis of such data, the values of the concomitant variable, the 
x,;, are eliminated by regression methods, namely, by estimating 6 by the 
method of least squares, and then an analysis of variance 1s performed on 
the adjusted y’s, namely, on the quantities y;; = y,; — bx;;. ; This proce- 
dure is referred to as an analysis of covariance, as it involves a partitioning 
of the total sum of products 


% 


— ya p> On DP Mig x.) 
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in the same way as an ordinary analysis of variance involves the parti- 
tioning of the total sum of squares. In actual practice, the calculations are 
performed as follows: 


1. The total, treatment, and error sums of squares are calculated for 
the x’s by means of the formulas for a one-way classification on 
pages 339 and 340; they will be denoted SST,, SS(Tr),, and SSE,. 

2. The total, treatment, and error sums of squares are calculated for 
the y’s by means of the formulas for a one-way classification on 
pages 339 and 340; they will be denoted SST,, SS(Tr),, and SSE,. 

3. The total, treatment, and error sums of products are calculated by 
means of the formulas 


SPE = SPT — SP(Tr) 


where the correction term, C, is given by 





and where 7, is the total of the x’s for the ith treatment, T,, is the 
total of the y’s for the ith treatment, 7, is the total of all the x’s, 
and T}, is the total of all the y’s. 

4. The total, error, and treatment sums of squares are calculated for 
the adjusted y’s by means of the formulas 


(SPT)? 
SST, 
(SPE)* 
SSE, 


SST, = SST, 


SSE, = SSE, — 


SS(Tn)y = SST, — SSE, 
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The results obtained in these calculations are conveniently summarized by 
means of the following kind of analysis-of-covariance table 


Sum of | Sum of | Sum of | Sum of 
Source of | squares | squares | prod- | squares | Degrees of 
variation for x for y ucts fory’ freedom Mean squares 


MS(Tr )y- 
_ SSCr)y 


k— 1 





Note that each mean square is obtained by dividing the corresponding sum 
of squares by its degrees of freedom. 

Finally, the null hypothesis a, = «7, =... = 4, = 0 is tested against 
the alternative hypothesis that the «; are not all equal to zero on the basis 
of the statistic 


pa MS(Tr)y 


MSE, 





It is rejected at the level of significance « if the value obtained for F 
exceeds F, with k — 1 and nk — k — 1 degrees of freedom. 


Example. To illustrate this kind of analysis, suppose a research worker 
has three different cleaning agents, A,, A,, and A;, and he wishes to 
select the most efficient agent for cleaning a metallic surface. The 
cleanliness of a surface is measured by its reflectivity, expressed in 
arbitrary units as the ratio of the reflectivity observed to that of a 
standard mirror surface. Analysis of covariance must be used because 
the effect of a cleaning agent on reflectivity will depend on the origi- 
nal cleanliness, namely, the original reflectivity, of the surface. The 
research worker obtained the following results: 
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Original reflectivity, x 0.50 





and the totals are 7,, = 2.00, T,, = 4.50, T,, = 3.00, T, = 9.50, i 
4.40, T,, = 2.40, T,, = 3.40, and T, = 10.20. 


For the x’s, the correction term is es = 7.52, and the sums of 
squares are 
SST, = 0.50? + 0.55? + ... + 0.70? — 7.52 = 1.31 


2 
SS(Tr), = sor en 3.00" — 7.52 — 0.79 


SSE, = 1.31 — 0.79 = 0.52 
For the y’s, the correction term is ear = 8.67, and the sums of 
squares are 
SST, = 1.00? + 1.207 + ... + 0.90? — 8.67 = 0.79 
SS(TY), — acl Dose — 8.67 = 0.50 


SSE, = 0.79 — 0.50 = 0.29 


For the sums of products, the correction term is CK) = 8.08, 


and we get 


SPT = (0.50)(1.00) + (0.55)(1.20) +... + (0.70)(0.90) — 8.08 
— —0.80 
SP(Tr) — (2:00)(4.40) + 4.502.40) + (3.00)(3.40) _ g og 


— 0.63 
SPE = —0,80 — (—0.63) = —0.17 
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Finally, for the adjusted y’s we get 


(—0.80)? 


Tay 0.30 


SST, — 0.79 — 


wee O17? 
SSEy = 0.29 — = = 0.23 
SS(Tr)y. = 0.30 — 0.23 = 0.07 


All these results are summarized in the following analysis-of-covariance 
table: 





Sum of | Sum of Sum of | Degrees 
Source of | squares | squares| Sum of | squares of Mean 
variation forx | fory | products | for y’ |freedom| squares 
Treatments 
Error 
Total 
Th 0.035 __ 1 34, and since this d iF p= 4 Ab 
us, f= 0006 — 134 and since this does not exceed Fy ,; = 4. 


for 2 and 8 degrees of freedom, the null hypothesis cannot be rejected. 
In other words, one cannot conclude that any one of the cleaning agents 
is more effective than the others. 


Analysis of covariance methods have not been widely used until recent 
years, due mainly to the rather extensive calculations that are required. Of 
course, with the widespread availability of computers and appropriate 
programs, this is no longer a problem. There are several ways in which the 
analysis of covariance method presented here can be generalized. First, 
there can be more than one concomitant variable; then, the method can 
be applied to more complicated kinds of designs, say, to a randomized 
block design, where the regression coefficient could even assume a dif- 
ferent value for each block. 


EXERCISES 


1. To compare the life expectancy of a transistor under three storage conditions 
and account at the same time for a leakage current (collector to base), a 
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laboratory technician obtained the following results, where the leakage cur- 
rent, x, is in microamperes, and the life times, y, are in hours: 


Storage Storage Storage 
condition I condition 2 condition 3 
x y x y x ¥ 

4.8 9,912 6.4 9,952 8.8 9,596 
ie: 9,383 8.7 9,482 6.2 9,697 
3.5 9,734 7.1 9,435 y 9,700 
6.0 9.551 5.3 9,915 4.9 9,610 
8.3 8,959 4.6 9,492 5.4 10,145 
7.6 9,474 6.0 9,565 5.8 10,191 
5.9 9,179 732 9,704 7.3 9,855 
8.0 9,359 8.8 9,636 8.6 9,682 
4,3 9,580 5.4 9,608 8.8 10,160 
a1 9,245 7.8 9,548 6.0 9,982 


Perform an analysis of covariance, using the level of significance & = 0.05. 
Also, estimate the value of the regression coefficient. 


2. Four different railroad-track cross-section configurations were tested to 
determine which is most resistant to breakage under use conditions. Ten miles 
of each kind of track were laid in each of six locations, and the number of 
cracks and other fracture-related conditions (y) was measured over a two- 
year usage period. To compare these track designs adequately, however, it is 
necessary to correct for extent of usage (x), measured in terms of the average 
number of trains per day that ran over each section of track. Use the following 
experimental results to test (0.01 level of significance) whether the track 
designs were equally resistant to breakage and to estimate the effect of usage 
on breakage resistance. 


Track Track Track Track 
design A design B design C design D 
x y x y x y x y 
10.4 3 16.9 8 17.8 5 19.6 9 
19.3 4 23.6 11 24.4 9 25.4 8 
13.7 4 14.4 7 135 ‘§ 35,5 16 

Tek 0 ie Be 10 20.1 6 16.8 7 
16.3 5 9.1 4 11.0 4 31.2 11 


3. Three different instrument-panel configurations were tested by placing airline 
pilots in flight simulators and testing their reaction time to simulated flight 
emergencies. Each pilot was faced with ten emergency conditions in a ran- 
domized sequence, and the total time required to take corrective action for 
all ten conditions was measured, with the following results: 
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Instrument Instrument Instrument 
panel I panel 2 panel 3 

x y > y x y 

8.1 6.55 12.1 5.74 15.2 6.37 
19.4 6.40 Zl 5.93 8.7 6.97 
11.6 5.93 3.9 6.16 Ted 7.38 
24.9 6.79 3.2 5.68 6.1 6.43 

6.2 7.16 4.6 5.41 11.8 7.59 

3.8 5.64 14.4 6.29 12.1 7.16 
18.4 5.87 16.1 Soo 9.5 7.02 

9.4 6,32 8.5 4.82 2.6 6.85 


In this table, x is the number of years of experience of the pilot, and y is the 
total reaction time in seconds. Perform an analysis of covariance to test 
whether the instrument-panel configurations yield significantly different results 
(% = 0.05). Also, perform a one-way analysis of variance (ignoring the covari- 
ate, x) and determine in that way what effect experience has on the results. 





Factorial 
Experimentation 





13.1 TWO-FACTOR EXPERIMENTS 


In Chapter 12 we were interested mainly in the effects of one variable, 
whose values we referred to as “treatments.” Extraneous variables were 
accommodated by means of blocks, replicates, or the rows and columns 
of Latin squares and more complicated kinds of designs. In this chapter 
we shall be concerned with the individual and joint effects of several vari- 
ables, and combinations of the values, or Jevels, of these variables will 
now play the roles of the different treatments. Extraneous variables, if 
any, will be handled as before. 


Example. To consider a simple two-factor (two-variable) experiment, 
suppose it is desired to determine the effects of flue temperature and 
oven width on the time required to make coke. The experimental con- 
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ditions used are 


Oven width Flue temperature 
(inches) (degrees F) 


1600 
1900 
1600 
1900 
1600 
1900 


NN COO Hh A 


1 
1 


and if several blocks (or replicates) were run, each consisting of these 
six “treatments,” it would be possible to analyze the data as a two-way 
classification and test for significant differences among the six treat- 
ment means. In this instance, however, the experimenter is interested 
in knowing far more than that—he wishes to know whether variations 
in oven width or in flue temperature affect the coking time, and perhaps 
also whether any changes in coking time attributable to variations in 
oven width are the same at different temperatures. 


It is possible to answer questions of this kind if the experimental condi- 
tions, the treatments, consist of appropriate combinations of the /evels (or 
values) of the various factors. The factors in the preceding example are 
oven width and flue temperature; oven width has the three levels 4, 8, and 
12 inches, while flue temperature has the two /evels 1,600 and 1,900 degrees 
Fahrenheit. Note that the six treatments were chosen in such a way that 
each level of oven width is used once in conjunction with each level of 
flue temperature. In general, if two factors A and B are to be investigated 
at a levels and b levels, respectively, and if there are a-b experimental con- 
ditions (treatments) corresponding to all possible combinations of the 
levels of the two factors, the resulting experiment is referred to as a com- 
plete a x b factorial experiment. Note that if one or more of the a-b experi- 
mental conditions is omitted, the experiment can still be analyzed as a 
two-way classification, but it cannot readily be analyzed as a factorial 
experiment. It is customary to omit the word “complete,” so that an 
a x b factorial experiment is understood to contain experimental condi- 
tions corresponding to all possible combinations of the levels of the two 
factors. 

In order to obtain an estimate of the experimental error in a two-factor 
experiment it is necessary to replicate, that is, to repeat the entire set 
of a-b experimental conditions, say, a total of r times, randomizing the 
order of applying the conditions in each replicate. If y,,, is the observation 
in the kth replicate, taken at the ith level of factor A and the jth level of 
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factor B, the model assumed for the analysis of this kind of experiment is 
usually written as 





Vie = H+ 0, + B, + PB); + Pe + € isk 





for i= 1, 2,...,aj=1,2,.+.,6, and k= 1,2,...,%. Here wv is the 
grand mean, «@, is the effect of the ith level of factor A, 8; is the effect of 
the jth level of factor B, («f),, is the interaction, or joint effect, of the ith 
level of factor A and the jth level of factor B, and p, is the effect of the kth 
replicate. As in the models used in Chapter 12 we shall assume that the 
€;;, are values of independent random variables having normal distribu- 
tions with zero means and the common variance o?. Also, analogous to 
the restrictions imposed on the models on pages 337 and 346, we shall 
assume that 


> a; = > B; => (af);; i x (af);; = D3 Pp, =O 


It can be shown that these restrictions will assure unique estimates for the 
parameters yw, «,, B;, (@f),;;, and px. 


Example. To illustrate the model underlying a two-factor experiment, 
let us consider an experiment with two replicates in which factor A 
occurs at two levels, factor B occurs at two levels, and the replication 
effects are zero, that is, p; = p, = O. In view of the restrictions on the 
parameters we also have 


2 = —O,,B,= —B, and (4f)o1 = (4B)12 = —(P)o2. = —COB)11 


and the population means corresponding to the four experimental 
conditions defined by the two levels of factor A and the two levels of 
factor B can be written as 


Mii = Myi2 =ME OH + B+ OB): 
Mia = Mia = ht & — B, —- OB) 
Moi. = Mar = H—-O% + By — OB) 
Moa = Mo = W— O% — Br + OB) 


Substituting for w,;; = 4,;, the mean of all observations obtained for 
the ith level of factor A and the jth level of factor B, we get four simul- 
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taneous linear equations which can be solved for the parameters y, «,, 
B:, and (af),1, (see Exercise 8 on page 393). 

To continue our illustration, let us now suppose that w = 10. If all 
of the other effects equalled zero, each of the y,;, would equal 10, and 
the response surface would be the horizontal plane shown in Figure 
13.1(a). If we now add an effect of factor A, witha, = —4, the response 
surface becomes the tilted plane shown in Figure 13.1(b), and if we add 
to this an effect of factor B, with B, = 5, we get the plane shown in 
Figure 13.1(c). Note that, so far, the effects of factors A and B are: 





Figure 13.1. Factorial effects. 


“= 
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additive, that is, the change in the mean for either factor in going from 
level 1 to level 2 does not depend on the level of the other factor, and 
the response surface is a plane. If we now include an interaction, with 
(#8),, = —2, the plane becomes twisted as shown in Figure 13.1(d), 
the effects are no longer additive, and the response surface is no longer 
a plane. Note, also, that if the replication effects had not equalled zero, 
we would have obtained a different surface for each replicate; for each 
replicate the surface of Figure 13.1(d) would have been shifted an 
appropriate number of units up or down. 


The analysis of ana x b factorial experiment is based on the following 
breakdown of the total sum of squares. First, we subdivide SST into 
components attributed to treatments, replicates (or blocks), and error, 
by means of the identity 


a b a »b 
»> Dy in — I)? = - >» Vi, =F 
i=1 j=1 k=1 i=1 j=1 
+ ab (.2 5.) 
a b r 
Tp 2a Bs Vite Vite Pe Daa) 


Except for notation, this identity is equivalent to that of Theorem 12.2. 
The total sum of squares, on the left-hand side of the identity, has abr — 1 
degrees of freedom. The terms on the right are, respectively, the treatment 
sum of squares having ab — 1 degrees of freedom, the replicate (block) 
sum of squares having r — 1 degrees of freedom, and the error sum of 
squares having (ab — 1)(r— 1) degrees of freedom. (Note that the 
various degrees of freedom are the same as those of the analysis of variance 
table on page 349 if we substitute ab for a and r for B.) 

So far there is nothing new about the analysis of the data; it is the 
analysis of a two-way classification, but the distinguishing feature of a fac- 
torial experiment is that the treatment sum of squares can be further subdivid- 
ed into components corresponding to the various factorial effects. Thus, for a 
two-factor experiment we have the following subdivision, or breakdown, 
of the treatment sum of squares: 


a b a _ 
r pS »s Vy, — ¥...)? = rb a ..=F,,) 7 J5.=F,2° 


The first term on the right measures the variability of the means corre- 
sponding to the different levels of factor A, and we refer to it as the factor 
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A sum of squares, SSA. Similarly, the second term is the sum of squares 
for factor B, SSB, and the third term is the interaction sum of squares 
SS(AB), which measures the variability of the means y,,, that is not 
attributable to the individual (or separate) effects of factors A and B. 
The ab — 1 degrees of freedom for treatments are, accordingly, subdivided 
into a — 1 degrees of freedom for the effect of factor A, b — 1 for the 


effect of factor B, and 
ab —1—(a—1)—(-1) =(a—- DYO-— 1) 
degrees of freedom for interaction. 
Example. To illustrate the analysis of a two-factor experiment, let us 


refer again to the coking experiment described on page 375, and let us 
suppose that three replicates yielded the following coking times (in 





hours): 
Factor A Factor B 
Oven width Flue temp. Rep. 1 Rep. 2 Rep. 3 Total 
4 1600 3.5 3.0 2.7 9.2 
4 1900 Zi 23 2.4 6.9 
8 1600 TA 6.9 fs 21.5 
8 1900 yy! 4.6 6.8 16.6 
12 1600 10.8 10.6 11.0 32.4 
12 1900 7.6 7.1 7.3 22.0 
Total 36.4 34.5 37.7 108.6 


Following the procedure used in analyzing a two-way classification, we 
first compute the correction term 


C= Gor — 655.22 


Then, the total sum of squares is given by 
SST = (3.5)? + (2.2)? + ... + (7.3)? — 655.22 = 149.38 


and the treatment and replicate (instead of blocks) sums of squares 
are given by 
SS(Tr) = 4[(9.2)* + (6.9)? + ... + (22.0)?] — 655.22 = 146.05 
SSR = 3[(36.4)? + (34.5)? + (37.7)?] — 655.22 = 0.86 
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Finally, by subtraction, we obtain 


SSE = 149.38 — 146.05 — 0.86 = 2.47 


Subdivision of the treatment sum of squares into components for 
factors A and B, and for interaction, can be facilitated by constructing 
the following kind of two-way table, where the entries are the totals 
in the right-hand column of the table giving the original data: 


Factor B 

Flue temperature 

1600 1900 
4 16.1 

Factor A 

Oven width g aoe 
12 54.4 
108.6 





Using formulas analogous to the ones with which we computed the 
sums of squares for various effects in Chapter 12, we now have for 
the two main effects 


SSA = pT -C 


= 4[(16.1)? + (38.1)? + (54.4)?] — 655.22 = 123.14 
Ch een Sis a ee 6 
Aer j=1 
= $[(63.1)? + (45.5)?] — 655.22 = 17.21 


and for the interaction 


SS(AB) = SS(Tr) — SSA — SSB 
= 146.05 — 123.14 — 17.21 = 5.70 


Finally, dividing the various sums of squares by their degrees of free- 
dom, and dividing the appropriate mean squares by the error mean 
square, we obtain the results shown in the following analysis of variance 
table: 
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Source of Degrees of Sum of 
variation freedom squares 


Replication 2 0.86 


Main effects: 
A 
B 


Interaction 


Error 


Total 





The F test for replications is not significant at the 0.05 or 0.01 levels o 
significance, but the other three F tests are significant at the 0.01 leve 
of significance. Hence, we reject the null hypothesis that the M,; are all 
equal to zero, the null hypothesis that the 8, are all equal to zero, and 
the null hypothesis that the (@),, are all equal to zero. These results are 
illustrated in Figure 13.2, showing the trend of mean coking times for 
changing oven width at each of the flue temperatures. It is apparent 
from this figure that the increase in coking time for changing oven width 
is greater at the lower flue temperature. In view of this interaction, great 
care must be exercised in stating the results of this experiment. For 
instance, it would be very misleading to state merely that the effect of 
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increasing flue temperature from 1,600 to 1,900 degrees Fahrenheit is 
to lower the coking time by ot — ae — 1.96 hours. In fact, the 
coking time is lowered on the average by as little as 0.77 hours when 
the oven width is 4 inches, and by as much as 3.47 hours when the 


oven width is 12 inches. 


13.2. MULTIFACTOR EXPERIMENTS 


Much industrial research and experimentation is conducted to dis- 
cover the individual and joint effects of several factors on variables which 
are most relevant to phenomena under investigation. The experimental 
designs most often used are of the simple randomized-block or two-way 
classification type, but the distinguishing feature of most of them is the 
factorial arrangement of the treatments, or experimental conditions. As 
we observed in the preceding section, r sets of data pertaining to a-b 
experimental conditions can be analyzed as a factorial experiment in r 
replicates if the experimental conditions represent all possible combina- 
tions of the levels of two factors A and B. In this section, we shall extend 
the discussion to factorial experiments involving more than two factors, 
that is, to experiments where the experimental conditions represent all 
possible combinations of the levels of three or more factors. 


Example. To illustrate the analysis of a multifactor experiment, let 
us consider the following situation. A warm sulfuric pickling bath is 
used to remove oxides from the surface of a metal prior to plating, 
and it is desired to determine what factors in addition to the concentra- 
tion of the sulfuric acid might affect the electrical conductivity of the 
bath. As it is felt that the salt concentration as well as the bath tem- 
perature might also affect the electrical conductivity, an experiment is 
planned to determine the individual and joint effects of these three 
variables on the electrical conductivity of the bath. In order to cover 
the ranges of concentrations and temperatures normally encountered, 
it is decided to use the following levels of the three factors: 





Factor Level I Level 2 Level 3 Level 4 
A. Acid concentration 0 6 12 18 
(percent) 
B. Salt concentration 0 10 20 
(percent) 
C. Bath temperature 80 100 
(degrees F) 


The resulting factorial experiment requires 4-3-2 = 24 experimental 
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conditions in each replicate, where each experimental condition is a 
pickling bath made up according to specifications. The order in which 
these pickling baths are made up should be random. Let us suppose 
that two replicates of the experiment have actually been completed, that 
is, the electrical conductivities of the various pickling baths have been 
measured, and that the results are as shown in the following table: 


RESULTS OF ACID-BATH EXPERIMENT 


Level of factor Conductivity (mhos/cm?3) 
A B G Rep. 1 Rep. 2 Total 
1 1 1 0.99 0.93 1.92 
1 1 2 1.15 0.99 2.14 
1 2 1 0.97 0.91 1.88 
1 2 2 0.87 0.86 1.73 
1 3 1 0.95 0.86 1.81 
1 3 2 0.91 0.85 1.76 
2 1 1 1.00 1.17 ZAF 
2 1 2 1.12 LAs 2.25 
2 Z 1 0.99 1.04 2.03 
2 Z 2 0.96 0.98 1.94 
2 3 1 0.97 0.95 1.92 
2 3 2 0.94 0.99 1.93 
3 1 1 1.24 122 2.46 
3 1 2 1.12 1.15 Ziai 
3 Z 1 Lats 0.95 2.10 
3 2 2 1.11 0.95 2.06 
3 3 1 1.03 1.01 2.04 
3 3. 2 1.12 0.96 2.08 
4 1 1 1.24 1.20 2.44 
4 1 2 1.32 1.24 2.56 
4 2 1 1.14 1.10 2.24 
4 Z Zz 1.20 1.19 2.39 
4 3 1 1.02 1.01 2.03 
4 3 2 1.02 1.00 202 
Total 25.33 24.64 50.17 


The model we shall assume for the analysis of this experiment (or any 
similar three-factor experiment) is an immediate extension of the one used 
in Section 13.1. If y,,,; is the conductivity measurement obtained at the 
ith level of acid concentration, the jth level of salt concentration, the kth 
level of bath temperature, in the /th replicate, we write 





Vint = UO; + B; + Vere (af); ; + (O))iz + (BY) jx 
COBY): 5% =“ Pr pe 
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106 PS de Qe ees Bed SS Wi 2g Dok = 12,240 96, Sd = Laat: 
We also assume that the sums of the main effects (a's, 6’s, and y’s) as well 
as the sum of the replication effects are equal to zero, that the sums of the 
two-way interaction effects summed on either subscript equal zero for any 
value of the other subscript, and that the sum of the three-way interaction 
effects summed on any one of the subscripts is zero for any values of the 
other two subscripts. As before, the €,,,; are assumed to be values of inde- 
pendent random variables having zero means and the common variance o”. 


Continuation of example. We begin the analysis of the data by treating 
the experiment as a two-way classification with a-b-c treatments and 
r replicates (blocks), and using the short-cut formulas on page 348, we 
obtain 


C= one — 52.4381 


SST = (0.99)? + (1.15)? + ... + (1.00)? — 52.4381 = 0.6624 
SS(Tr) = 4{(1.92)? + (2.14)? + ... + (2.02)?] — 52.4381 = 0.5712 

SSR = y[(25.53)? + (24.64)?] — 52.4381 = 0.0165 

SSE = 0.6624 — 0.5712 — 0.0165 = 0.0747 


The degrees of freedom for these sums of squares are, respectively, 47, 
23, 1, and 23. 

Next, we shall want to subdivide the treatment sum of squares into 
the three main effect sums of squares SSA, SSB, SSC, the three two-way 
interaction sums of squares, SS(AB), SS(AC), and SS(BC), and the 
three-way interaction sum of squares SS(ABC). To facilitate the cal- 
culation of these sums of squares we first construct the following three 
tables analogous to the one on page 381: 


Cc 
1 2 s 1 2 
1 11.24 1 11.24 
2 12.24 2 12.24 
A A 
3 13.01 3 13.01 
4 13.68 4 13.68 





18.21 16.37 15.59 50.17 25.04 25.13 50.17 
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25.04 


eae 





18.21 16.37 15.59 50.17 


The entries of these tables are the totals of all measurements obtained 
at the respective levels of the two variables. Note the self-checking 
feature of these tables; the same marginal totals appear several times, 
thus providing a rapid and effective check on the calculations. 

To calculate SSA, SSB, and SS(AB) we refer to the first of the 
above tables and an identity analogous to the one on page 379. As 
a matter of fact, the calculations parallel those for calculating SSA, 
SSB, and SS(AB) in the two-factor experiment. To take the place of 
the treatment sum of squares we first calculate 


a b 
red 2 Vu. =P) = is yh Ti; ATE 
= 4[(4.06)? + (4.42)? + ... + (4.05)?] 
— 52.4381 
=2 19301 


and we then obtain 
lig _ 
SSA = har 2 T? —C 


= A f(11.24)? + ... + (13.68)? — 52.4381 
= 0.2750 


ed SpE ax 
SSB = 7 74. —C€ 


= 7[(18.21)? + (16.37)? + (15.59)7] — 52.4381 
== 2262 
and 
SS(AB) = 0.5301 — 0.2750 — 0.2262 


= 0.0289 


Performing the same calculations for the second of the tables on page 
385 we obtain, similarly, 


SSC = 0.0002 and SS(AC) = 0.0085 
and the analysis of the third table yields 
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SS(BC) = 0.0042 


For the three-way interaction sum of squares we finally obtain by sub- 


traction 
SS(ABC) 
= §S(Tr) — SSA — SSB — SSC — SS(AB) — SS(AC) 
— SS(BC) 
= 0.5712 — 0.2750 — 0.2262 — 0.0002 — 0.0289 — 0.0085 
— 0.0042 
= 0.0282 


Note that the degrees of freedom for each main effect is one less than 
the number of levels of the corresponding factor. The degrees of free- 
dom for each interaction is the product of the degrees of freedom for 
those factors appearing in the interaction. Thus, the degrees of freedom 
for the three main effects are 3, 2, and 1 in this example, while the 
degrees of freedom for the two-way interactions are 6, 3, and 2, and the 
degrees of freedom for the three-way interaction is 6. 

The following table shows the complete analysis of variance for 
the acid-bath experiment: 


Source of Degrees of Sum of Mean 
variation freedom squares square F 
Replicates 1 0.0165 0.0165 5.16 
Main effects: 
A 3 0.2750 0.0917 28.66 
B Z 0.2262 0.1131 35.34 
C 1 0.0002 0.0002 =<] 
Two-factor 
interactions: 
AB 6 0.0289 0.0048 1.50 
AC 3 0.0085 0.0028 ae | 
BC 2 0.0042 0.0021 < | 


ES | Sf SS | 


Three-factor 


interaction: 
ABC 6 0.0282 0.0047 1.47 
Error 23 0.0747 0.0032 
Total 47 0.6624 
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Obtaining the appropriate values of F.); and Fo, from Table 6, we 
find that the test for replicates is significant at the 0.05 level (perhaps 
the two replicates were performed under different atmospheric condi- 
tions or the thermometer used to measure bath temperatures went out 
of calibration, etc.), the tests for the factor A and factor B main effects 
are significant at the 0.01 level, while none of the other F’s are signifi- 
cant at either level. We conclude from this analysis that variations in 
acid concentration and salt concentration affect the electrical conduc- 
tivity, variations in bath temperature do not, and that there are no 
‘interactions. To go one step further, we might investigate the magni- 
tudes of the effects by studying graphs of means like those shown in 
Figures 13.3 and 13.4. Here we find that the conductivity increases as 





Figure 13.3. Effect of acid concentration. 





Figure 13.4. Effect of salt concentration. 
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acid is added and decreases as salt is added; using the methods of 
Chapter 11 we might even fit lines, curves, or surfaces to describe the 
response surface relating conductivity to the variables under consid- 
eration. 


The general computational procedure for a multifactor experiment is 
similar to the method illustrated here fora4 x 3 x 2 factorial experiment. 
We first analyze the data as a two-way classification (or whatever other 
design is being used) and then we analyze the treatment sum of squares 
into the components attributed to the various main effects and interac- 
tions. In general, the sum of squares for each main effect is found by 
adding the squares of the totals corresponding to the different levels of 
that factor, dividing by the number of observations comprising each of 
these totals, and then subtracting the correction term. The sum of squares 
for any interaction is found by adding the squares of all totals obtained by 
summing over those subscripts pertaining to factors not involved in the 
interaction, dividing by the number of observations comprising each of 
these totals, and then subtracting the correction term and all sums of 
squares corresponding to main effects and fewer-factor interactions 
involving the factors contained in that interaction. 


EXERCISES 


1. To determine optimum conditions for a plating bath, the effects of sulfone 
concentration and bath temperature on the reflectivity of the plated metal are 
studied in a 2 x 5 factorial experiment. The results of three replicates are as 
follows: 


Concentration Temperature Reflectivity 
(grams/liter) (degrees F) Rep. 1 Rep. 2 Rep. 3 
ecient eee 
5 75 35 39 36 
5 100 31 37 36 
> 125 30 31 33 
= 150 28 20 23 
> 195 19 18 Zz 
10 75 38 46 41 
10 100 36 44 39 
10 125 39 32 38 
10 150 35 47 40 
10 175 30 38 31 


Analyze these results and determine the bath condition or conditions that 
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nN 


produce the highest reflectivity. Also construct a 0.95 confidence interval for 
the reflectivity of the plating bath corresponding to these optimum conditions. 


. A spoilage retarding ingredient is added in brewing beer. To determine the 


extent to which the taste of the beer is affected by the amount of this ingredient 
added to each batch, and how such taste changes might depend on the age of 
the beer, a 3 x 4 factorial experiment in two replications was designed. The 
taste of the beer was rated on a scale of 0, 1, 2, or 3 (3 being the most desirable) 
by a panel of trained experts, who reported the following mean ratings: 


Amount of ingredient Aging period Mean ratings 
(grams per batch) (weeks) Rep. 1 Rep. 2 
Z 2 2.1 1.6 
Z 4 2.6 1.9 
2 6 2.9 2.4 
3 2 1.4 Lg 
3 4 1.9 2i2 
3 6 23 Dad 
4 2 0.5 0.9 
4 4 LZ 0.8 
4 6 1.7 1.4 
5 2 1.0 1.6 
5 4 22 13 
2 6 2.3 2.1 


(a) Analyze the data first as a two-way classification with 12 treatments and 
2 blocks (replicates). 

(b) Calculate the sums of squares corresponding to the main effects and 
interaction, and present the results in an analysis of variance table. 

(c) Interpret the results of the experiment. 


. Suppose that in the experiment described on page 349 it is desired to deter- 


mine also whether there is an interaction between the detergents and the 
washing machines, that is, whether one detergent might perform better in 
Machine 1, another might perform better in Machine 2, and so forth. Com- 
bining the data on page 349 with the replicate of the experiment given below, 
test for a significant interaction and discuss the results. 


Machine 1 Machine2 Machine 3 


Detergent A 
Detergent B 
Detergent C 


Detergent D 
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4. Referring to Exercise 4 on page 355, it can be argued that the rocket-launcher 
experiment was poorly designed, as it was not replicated, and, thus, it is impos- 
sible to test whether there is an interaction between launchers and rocket 
fuels. Suppose a second replicate were performed, with the following results: 


Fuel I Fuel II Fuel II Fuel IV 
Launcher X 66.8 51,1 40.1 49.2 
Launcher Y 47.7 38.2 50.6 64.3 


Combining the results of both replicates, perform an appropriate analysis of 
variance, and test for the presence of an interaction. 


5. The following tables give the weights (in grams) of food ingested by two 
different strains of rats after having been deprived of food for the stated num- 
ber of hours and then having been given the stated dosage of a certain drug: 


Replicate 1 Replicate 2 
Strain A Strain B Strain A Strain B 

I hour 

Dosage 
6.2 mele 4 hours 
8 hours 
I hour 

Dosage 
Bid capil 4 hours 
& hours 
I hour 

Dosage 
0.8 molke 4 hours 
8 hours 





Perform an appropriate analysis of variance and interpret the results. 


6. A study was conducted to measure the effect of three different meat tenderizers 
on the weight loss of steaks having the same initial (precooked) weights. The 
effects of cooking temperatures and cooking times also were measured by 
performing a 3 x 2 x 2 factorial experiment in 3 replicates. The results are 
as follows: 
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Cooking time Cooking Weight loss (ounces) 
Tenderizer (minutes) temperature (°F) Rep.1 Rep.2 Rep. 3 


A 20 350 15 1.3 1.4 
A 20 400 1.6 1.4 Lo 
A 30 350 Lt 1.8 1.7 
A 30 400 1.8 19 2.0 
B 20 350 12 2.1 2.0 
B 20 400 2.2 2.4 Zio 
B 30 350 2.6 Pe 2.4 
B 30 400 2.6 ee | 2.5 
Cc 20 350 0.9 0.8 0.8 
Cc 20 400 1.1 1.0 0.9 
C 30 350 0.8 0.9 1.0 
Cc 30 400 hae 1.0 1.1 


(a) Analyze this experiment first as a two-way classification with 12 treat- 
ments and 3 replicates (blocks). 

(b) Complete the analysis by computing the sums of squares corresponding 
to the various main effects and interactions. 

(c) Present the results in an analysis of variance table and interpret the 
experiment. 


7. To study the effects of ingot location (A), slab position (B), specimen prepara- 
tion (C), and twisting temperature (D) on the number of turns required to 
break a steel] specimen by twisting, the following observations were recorded: 


No. of turns 


A B C D Rep. 1 Rep. 2 
top 1 turn 2,100°F 24 22 
top 1 turn 2,200 25 28 
top 1 turn 2,300 41 39 
top 1 grind 2,100 18 18 
top 1 grind 2,200 33 27 
top 1 grind 2,300 35 41 
top 2 turn 2,100 Le 19 
top 2 turn 2,200 26 31 
top 2 turn 2,300 JF 43 
top Z grind 2,100 23 7 
top 2 grind 2,200 30 26 
top 2 grind 2,300 34 30 
mid 1 turn 2,100 26 19 
mid 1 turn 2,200 30 31 
mid 1 turn 2,300 39 42 
mid 1 grind 2,100 19 19 
mid 1 grind 2,200 31 31 
mid 1 grind 2,300 26 35 
mid Z turn 2,100 30 26 
mid 2 turn 2,200 31 34 
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No. of turns 
A B c D Rep. 1 Rep. 2 

mid 2 turn 2,300 39 42 
mid 2 grind —_—-.2,,100 22 20 
mid 2 grind 2,200 32 26 
mid 2 grind 2,300 38 22, 
bot 1 turn 2,100 18 21 
bot 1 turn 2,200 35 32 
bot 1 turn 2,300 34 37 
bot 1 grind 2,100 21 19 
bot 1 grind 2,200 20 29 
bot 1 grind 2,300 44 31 
bot Z turn 2,100 23 22 
bot z turn 2,200 31 26 
bot 2 turn 2,300 38 41 
bot 2 grind 2,100 18 19 
bot 2 grind 2,200 31 24 
bot 2 grind 2,300 35 41 


Analyze this experiment. 


8. Solve the four equations on page 377 for 4, 1, B;, and (#f),,; in terms of the 
population means j;;; corresponding to the four experimental conditions in 
the first replicate. Note that these equations serve as a guide for estimating 
the parameters in terms of the sample means corresponding to the various 
experimental conditions. 


13.3. 2” FACTORIAL EXPERIMENTS 


There are several reasons why factorial experiments are often per- 
formed with each factor taken at only two levels. Primarily, the number of 
experimental conditions in a factorial experiment increases multiplicatively 
with the number of levels of each factor; thus, if many factors are to be 
investigated simultaneously, it may be economically impossible to include 
more than two levels of each factor. Another important reason for treating 
2" factorial experiments separately is that there exist computational short- 
cuts which apply only to this case. In fact, the remainder of this section 
will be devoted to these computational short-cuts, while other advantages, 
such as the ease of confounding higher-order interactions and the adapt- 
ability of 2” factorials to experiments involving fractional replication, will 
be discussed in succeeding sections. 

Before we introduce some of the special notation used in connection 
with 2” factorial experiments, let us point out that such experiments do 
have some drawbacks. Since each factor is measured only at two levels, 
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it is impossible to judge whether the effects produced by variations in a 
factor are linear or, perhaps, parabolic or exponential. For this reason 2” 
factorial experiments are often used in “screening experiments,” which 
are followed up by experiments involving fewer factors (ordinarily those 
found to be “significant” individually or jointly in the screening experi- 
ment) taken at more than two levels. 

In the analysis of a 2” factorial experiment it is convenient to denote the 
two levels of each factor by 0 and 1 (instead of 1 and 2). Thus, the models 
used for the analysis of this kind of experiment differ from those of Section 
13.2 only inasmuch as we now have i = 0, 1 instead of i= 1,2,..., a, 
j =O, 1 instead of 7 = 1, 2,...,5, and so forth. For instance, for a 23 
factorial experiment the model on page 384 becomes 


Vijet = M+O + By + ve + OB); + OM) + (BY) x 
= (OB Y); jx + Pi Eye 


fori=0,1,7=0,1,4 =0,1,and/=1,2,...,7. The €,,,; are defined 
as before, and the parameters are now subject to the restrictions 7, = —Qp, 


f, = —Bos 1 = Pos (OB)i0 = (OB)o: = —(4f)11 = —(aB)o0; yas Bn 


S| Pp; = 0. Note that besides the parameters for replicates we need only one 
l=1 


parameter of each kind: that is, besides the parameters for replicates, we 
can express the entire model in terms of the parameters 12, &, Bos Yo. (@B)oo; 
(GY )oor (By)oo, and (@PY)oo0. 

A 2” factorial experiment requires 2” experimental conditions; since 
their number can be fairly large, it will be convenient to represent the 
experimental conditions by means of a special notation and list them in a 
so-called standard order. The notation consists of representing each 
experimental condition by the product of lower-case letters corresponding 
to the factors which are taken at level 1, called the “higher level.” If a 
lower-case letter corresponding to a factor is missing, this means that the 
factor is taken at level 0, called the “lower level.” Thus, in a three-factor 
experiment, ac represents the experimental condition where factors A and 
C are taken at the higher level and factor B is taken at the lower level, 
c represents the experimental condition where factor C is taken at the 
higher level and factors A and B are taken at the lower level, and so forth. 
The symbol “1” is used to denote the experimental condition in which all 
factors are taken at the lower level. 

Although the experimental conditions are applied in a random order 
during the experiment itself, for the purpose of analyzing the results it is 
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convenient to arrange them in a so-called standard order. For n = 2, this 
order is 1, a, b, ab, and for n = 3 it is the order shown in the following 
table: 


Experimental Level of factor 
condition A B C 
1 0 0 0 
a 1 0 0 
b 0 1 0 
ab 1 1 0 
Cc 0 0 1 
ac 1 0 1 
be 0 1 1 
abc 1 1 1 


Note that the symbols for the first four experimental conditions are like 
those for a two-factor experiment, and that the second four are obtained 
by multiplying each of the first four symbols by c. Similarly, the arrange- 
ment for n = 4 on page 398 is obtained by first listing the eight symbols 
for a three-factor experiment and then repeating the set with each symbol 
multiplied by d. 

Throughout this and the preceding chapter we referred to the total of 
all observations corresponding to a given experimental condition as a 
treatment total, and we represented these totals by means of symbols 
such as T,, T;,, and so forth. Having introduced a special notation for 
the experimental conditions in a 2” factorial experiment, we extend this 
notation by letting (1), (a), (b), (ab), (c),..., be the treatment totals 


corresponding to experimental conditions 1, a,b, ab,c,.... Thus, in a 
three-factor experiment 

(1) = 21 Yoo (a) = 2 Vr007 

(bc) = bu Voit (abe) = dy visu 


The computational short-cuts referred to on page 393 consist, essen- 
tially, of expressing estimates of the various main effects and interactions 
as well as the corresponding sums of squares, in terms of Jinear combinations 
of the treatment totals. To illustrate what we mean, let us consider the 
quantity 


—(1) + (a) — (6) + (ab) — (c) + (ae) — (ce) + (abc) 


which is a linear combination, with coefficients +1 and —1, of the treat- 
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ment totals corresponding to the eight experimental conditions. Referring 
to the model equation on page 394 and making use of the relationships 
among the parameters (but leaving all details to the reader in Exercises 
6, 7, and 8 on pages 404 and 405) it can be shown that 


—(1) + (@ — (6) + (ab) — (©) + (ac) — (6c) + (abe) = —8ro9 + €4 


where €, is a corresponding linear combination of sums of the €;;,). From 
Theorem 7.1 on page 215 it follows that €, is a value of a random variable 
whose distribution has zero mean; as a matter of fact, it can be shown 
that €, is a value of a random variable having a normal distribution with 
zero mean and the variance 8ro”. Referring to the above linear combina- 
tion as the effect total [A] for factor A, we find that —[A]/8r provides an 
estimate fo a, the main effect for factor A, and it can be shown that 
[A]?/8r actually equals SSA, the sum of squares for the main effect of 
factor A. 
Similarly analyzing the linear combination 


(1) + (@ — (6) — (ab) — (©) — (ae) + (6c) + (abc) 


the reader will be asked to show in Exercise 8 on page 405 that it equals 
8r(By)oo + €ac, Where €p¢ 1S a corresponding linear combination of sums of 
the €;,;,;. Referring to this linear combination of treatment totals as the 
effect total [BC] for the two-way interaction of factors B and C, we find that 
[BC]/8r provides an estimate of (By) 9, the effect of the BC interaction, and 
it can also be shown that [BC]?/8r equals SS(BC), the sum of squares for 
the BC interaction. Proceeding in this fashion, we can present linear com- 
binations of the treatment totals which yield estimates of the various other 
main effects and interactions, and whose squares, divided by 8r, yield the 
corresponding sums of squares. These linear combinations, or effect totals, 
can easily be obtained with the use of the following table of signs: 


Effect 
(1) (a) (b) (ab) (c) (ac) (bc) (abc) totals 
1 1 1 1 1 1 1 1 [Z] 
—] 1 —] 1 —] 1 —] 1 [A] 
—] —] 1 1 —] —] 1 1 [B] 
1 —] —1 1 1 —1 —] 1 [AB] 
of of ~2i inal 1 1 1 1 [C] 
1 —] 1 —] —] 1 —] 1 [AC] 
1 1 —] —1 —1 —] 1 1 [BC] 
—1 1 1 —] 1 —1 —1 1 [ABC] 
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The entries of this table are the coefficients of the linear combinations 
of the treatment totals for the various main effects and interactions. As 
an aid for constructing similar tables for n = 4, n = 5, etc., note that for 
each main effect there is a “-++1” when the factor is at the higher level and 
a “—1” when the factor is at the lower level. The signs for an interaction 
effect are obtained by multiplying the corresponding coefficients of al/ 
factors contained in the interaction. Thus, for [4B] we multiply each 
sign for [A] by the corresponding sign for [B], getting 


(—I(-)D MAY) DM MG) -1I(-) @(-) (—)d) MQ 
OT 


] =] =i J I —|] —|] J 


Note also that in the above table [J] stands for the grand total of all the 
observations, so that [/]*/8r gives the correction term for calculating SST, 
SSE, SSR, and SS(7r). 

Although we have illustrated the above short-cut method for obtaining 
the various main-effect and interaction sums of squares with reference to a 
2° factorial experiment, the only difference in a 2” factorial experiment with 
n > 31s that we require a more extensive table of signs and that the respec- 
tive sums of squares are obtained by dividing the squares of the effect totals 
by r+2”. 


Example. To illustrate this technique and introduce a further simplifi- 
cation, let us consider the following 2* factorial experiment, designed 
to determine the effects of certain variables on the reliability of a rotary 
stepping switch. The factors studied were as follows: 





Factor Low level High level 
A. Lubrication dry lubricated 
B. Dust protection unprotected enclosed in 
dust cover 
C,. Spark suppression no yes 
D. Current 0 0.5 amp 


Each switch was operated continuously until a malfunction occurred, 
and the number of hours of operation was recorded. The whole experi- 
ment was performed twice, with the following results: 
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Experimental Hours of operation 

condition Rep. 1 Rep, 2 Total 
1 828 797 1,625 

a 997 948 1,945 

b 123 776 1,511 

ab 807 1,003 1,810 

C 994 949 1,943 

ac 1,069 1,094 2,163 

be 989 1,215 2,204 
abc 889 1,010 1,899 

d 593 813 1,406 

ad 773 1,026 1,799 

bd 740 922 1,662 
abd 936 1,138 2,074 

cd 748 970 1,718 
acd 1,202 1,182 2,384 
bcd 1,103 966 2,069 
abcd 985 1,154 2,139 
Totals 14,388 15,963 30,351 


Analyzing these data first as a two-way classification with 16 treat- 
ments and 2 replications (blocks), we obtain 


c — 30,351)" _ 23 786,975 
oy) 
SST = (828)? + (997)? + ... + (1,154)? — 28,786,975 


= 744,876 
SS(Tr) = 4{(1,625)? + (1,945)? + ... + (2,139)] — 28,786,975 
— 547,288 
SSR = +,[(14,388)? + (15,963)?] — 28,786,975 
— 77,520 


SSE = 744,876 — 547,288 — 77,520 = 120,068 


In order to subdivide the treatment sum of squares into SSA, SSB,..., 
and SS(ABCD), we could construct a table of signs like the one on 
page 396, calculate the effect totals, and then divide the squares of 
the effect totals by r-2” = 2-24 = 32. For the A factor main effect we 
would thus obtain 
[A] = —1,625 + 1,945 — 1,511 + 1,810 — 1,943 + 2,163 — 2,204 
+ 1,899 — 1,406 + 1,799 — 1,662 + 2,074 — 1,718 + 2,384 
— 2,069 + 2,139 
= 2,075 
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and 
(2,075)? 
32 


These calculations are quite tedious, but they can be simplified con- 
siderably by using a further short-cut, called the method of Yates. This 
method of calculating the effect totals is illustrated in the table shown 
below. The experimental conditions and the corresponding totals are 
listed in standard order. In the column marked (1), the upper half is 
obtained by adding successive pairs of treatment totals, and the lower 
half is obtained by subtracting successive pairs. Thus, in column (1) 
we obtained 


SSA = = 134,551 


1,625 + 1,945 = 3,570 
1,511 + 1,810 = 3,321 


ee3coeeeet fe eee eee © & Be 


2,069 + 2,139 = 4,208 
1,945 — 1,625 = 320 
1,610 1511 = 299 


Experi- | Treat- 
mental ment Identifi- Sum of 
condition | totals (J) (2) (3) (4) cation Squares 
1 1,625 | 3,570 | 6,891 | 15,100 30,351 | [Z] 28,786,975 
a 1,945 | 3,321 | 8,209 | 15,251 2,075 | [A] 134,551 
b 1,511 | 4,106 | 6,941 534 385 | [B] 4,632 
ab 1,810 | 4,103 | 8,310 | 1,541 | —1,123 | [4B] 39,410 
é 1,943 | 3,205 619 | —252 2,687 | [C] 225,624 
ac 2,163 | 3,736 —85 637 —773 | [AC] 18,673 
be 2,204 | 4,102 805 | —546 —179 | [BC] 1,001 
abe 1,899 | 4,208 736 | —577 | —1,119 | [ABC] 39,130 
d 1,406 320 | —249 1,318 151 | [D] 713 
ad 1,799 299 —3 1,369 1,007 | [AD] 31,689 
bd 1,662 220 531 | —704 889 | [BD] 24,698 
abd 2,074 | —305 106 — 69 —31 | [ABD] 30 
cd 1,718 393 —21 246 51 | [CD] 81 
acd 2,384 412 | —525 | —425 635 | [ACD] 12,601 
bcd 2,069 666 19 | —504 —671 | [BCD] 14,070 
abcd 2,139 70 | —596 | —615 —111 | [ABCD] 385 


Note that the first total in each pair is subtracted from the second. 
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Column (2) is then obtained by performing the identical operations on 
the entries of column (1), and columns (3) and (4) are obtained in the 
same manner from the entries in columns (2) and (3), respectively. 
Column (4), and in general column (n), gives the effect totals in stand- 
ard order, as shown. Each sum of squares is then obtained as before, 
by squaring the corresponding effect total and then dividing the result 
by 762% = 202? = 32, 

Dividing the sums of squares by their degrees of freedom to obtain 
the mean squares, and dividing the various mean squares by the error 
mean square, we get the following analysis of variance table for the 
2‘ factorial experiment: 













Sum of Mean 
squares square 


Degrees of 
freedom 


Source of 
variation 











me a | | 
SS _E—=——E=—=>EeaSQQaa=@@aS=Saaaaeeeee Ss OOO eee 
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744,876 
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Since F'),; = 4.54 and F.,, = 8.68 for 1 and 15 degrees of freedom, we 
find that the replication effects as well as the effects of lubrication and 
spark suppression are significant at the 0.01 level, and that there are 
Significant interactions at the 0.05 level between lubrication, dust 
protection, and spark suppression. The reader will be asked to inter- 
pret these results and estimate the magnitude of some of the effects 
in Exercise 9 on page 405. 


EXERCISES 


. To determine the effect on taste of three different factors in manufacturing 


soft-drink cans, an experiment was performed where the taste of one soft 
drink was rated by a judge on a scale from 1 to 10. The results are as fol- 
lows: 





A B c Ratings 
Lubricant Heat Resin Rep. 1 Rep. 2 
fresh unheated A 6 8 
fresh unheated B 8 7 
fresh heated A 9 9 
fresh heated B 1 2 
aged unheated A 6 7 
aged unheated B 6 8 
aged heated A 9 8 
aged heated B Z 3 


(a) Analyze the results first as a two-way classification with 7 degrees of 
freedom for treatments and 1 degree of freedom for blocks (replicates). 

(b) Use an appropriate table of signs to calculate the effect totals [A], [B], 
[C], [AB], [AC], [BC], [ABC]. ; 

(c) Using the results obtained in (b) find the sums of squares corresponding 
to the main effects and interactions, and check their total against the 
treatment sum of squares obtained in (a). 

(d) Arrange the data with treatment combinations in standard order, and 
use the Yates method to find the effect totals. Compare with the results 
obtained in (b). 

(e) Construct an analysis of variance table and analyze the experiment. 


. An experiment was conducted to determine the effects of certain alloying 


elements on the ductility of a metal, and the following results were 
obtained: 
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Breaking strength (ft-lb) 


Nickel Carbon Manganese Rep. 1 Rep. 2 Rep. 3 
0.0% 0.3% 0.5% 36.7 39.6 38.2 
0.0 0.3 1.0 47.5 43.5 45.9 
0.0 0.6 0.5 40.6 36.8 36.0 
0.0 0.6 1.0 41.1 45.8 46.4 
4.0 0.3 0.5 37.8 32.7 31.6 
4.0 0.3 1.0 34.2 S12 36.5 
4.0 0.6 0.5 39.5 41.7 39.1 
4.0 0.6 1.0 46.4 43.7 49.4 


Perform an appropriate analysis of variance and interpret the results. 


3. A screening experiment was conducted to determine what factors are influ- 
ential in controlling the final phosphorus content of steel produced in a 
converter. The levels of the factors studied and the experimental results are 
contained in the following table: 


E D C B A 
Pouring Lime Original Original Final phosphorus 
temp.(°F) ratio Oxygen phosphorus manganese’ Rep. 1 Rep. 2 

2,400 3 nage O13 1%, 0.003 % 0.001% 
2,400 3 5 0.15 3 0.004 0.009 
2,400 3 5 0.30 1 0.002 0.008 
2,400 3 5 0.30 3 0.015 0.007 
2,400 3 15 0.15 1 0.002 0.005 
2,400 3 Ls 0.15 3 0.011 0.006 
2,400 3 15 0.30 1 0.004 0.001 
2,400 3 15 0.30 3 0.002 0.004 
2,400 4 5 0.15 1 0.000 0.003 
2,400 4 5 0.15 3 0.008 0.002 
2,400 4 5 0.30 1 0.003 0.007 
2,400 4 5 0.30 3 0.005 0.012 
2,400 4 15 0.15 1 0.010 0.006 
2,400 4 15 0.15 3 0.006 0.001 
2,400 4 15 0.30 1 0.006 0.014 
2,400 4 15 0.30 3 0.011 0.015 
2,600 3 5 0.15 1 0.003 0.007 
2,600 3 5 0.15 3 0.007 0.004 
2,600 3 5 0.30 1 0.011 0.005 
2,600 3 5 0.30 K, 0.010 0.017 
2,600 3 15 0.15 1 0.004 0.008 
2,600 3 15 0.15 3 0.019 0.013 
2,600 3 15 0.30 1 0.004 0.008 
2,600 3 15 0.30 3 0.017 0.023 
2,600 4 5 0.15 1 0.007 0.004 
2,600 4 5 0.15 3 0.015 0.009 
2,600 4 5 0.30 1 0.004 0.011 
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E D & B A 
Pouring Lime Original Original Final phosphorus 
temp.(°F) ratio Oxygen phosphorus manganese Rep. I Rep. 2 

2,600 4 5 0.30 3 0.010 0.006 
2,600 4 15 0.15 1 0.017 0.011 
2,600 4 15 0.15 3 0.005 0.010 
2,600 4 15 0.30 1 0.014 0.009 
2,600 4 15 0.30 3 0.016 


Analyze the results of this experiment. 


4. An experiment was conducted to determine the effects of the following fac- 


tors on the gain of a semiconductor device: 


Factor Level 0 Level J 
A. Location of assembly Laboratory Production line 
B. Partial pressure of 10-15 10-4 
controlling material 
C. Relative humidity 1% 30% 
D. Aging time 72 hours 144 hours 
The results were as follows: 
Experimental Gain 
condition Rep. 1 Rep. 2 

1 39.0 43.2 

a 31.8 43.7 

b 47.0 51.4 

ab 40.9 40.3 

Cc 43.8 40.5 

ac 29.3 52.9 

be 34.8 48.2 

abc 45.6 58.2 

d 40.1 41.9 

ad 42.0 40.5 

bd 54.9 53.0 

abd 39.9 40.2 

cd 43.1 40.2 

acd 30.1 39.9 

bcd 35.6 53.7 

abcd 41.4 49.5 


0.011 


Perform an appropriate analysis of variance and interpret the results. 


5. A study was performed to examine the effect of five different factors on the 
time required for odorants in natural gas to reach the surface once a leak 
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has occurred. The results of the experiment were reported as follows: 





A B C D E Time 
Odorant Soil Flow Tempera- (hours) 
Odorant concentration moisture rate ture Rep. 1 Rep. 2 
mg/ml ml/min 

A 4 oo 20 50°F 20 20 
A 4 8 20 70 17 18 
A 4 8 80 50 20 19 
A 4 8 80 70 18 19 
A 4 16 20 50 12 13 
A 4 16 20 70 14 16 
A 4 16 80 50 15 14 
A 4 16 80 70 17 16 
A 16 8 20 50 10 9 
A 16 8 20 70 9 8 
A 16 8 80 50 8 9 
A 16 8 80 70 7 8 
A 16 16 20 50 7 6 
A 16 16 20 70 6 4 
A 16 16 80 50 5 6 
A 16 16 80 70 4 5 
B 4 8 20 50 28 29 
B 4 8 20 70 27 26 
B 4 8 80 50 25 27 
B 4 8 80 70 28 28 
B 4 16 20 50 24 2 
B 4 16 20 70 26 25 
B 4 16 80 50 23 24 
B 4 16 80 70 24 25 
B 16 8 20 50 26 24 
B 16 8 20 70 23 25 
B 16 8 80 50 ZT 26 
B 16 8 80 70 23 26 
B 16 16 20 50 19 18 
B 16 16 20 70 16 17 
B 16 16 80 50 20 19 
B 16 16 80 70 17 15 


Analyze the results of this experiment. 


6. Writing the treatment total (a) as the sum of the corresponding observations 
Y100; and substituting for these observations the expressions given by the 
model equation on page 394, it can be shown that 


(a) =r[e +O, + Bo +¥0 + OP)10 + (HY)10 + (BY)oo + (BY)100] 
ca > E100! 
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10. 


11. 


Making use of the restrictions imposed on the parameters, rewrite this ex- 
pression for (a) in terms of the parameters 4, %, Bo, Yo, (&B)o0, (47)o0, 


(By)oo, and (&BY)o00. 


- Duplicating the work of Exercise 6, express (1), (6), (ab), (c), (ac), (bc), and 


(abc) in terms of the parameters , %o, Bo, Yo, (&PB)o0, (&P)o0, (BY)o0, and 
(0 BY)oo0- 


. Using the results of Exercises 6 and 7, verify the expressions for [A] and 


[BC] obtained on page 396. Also express € 4 in terms of the quantities €; ;,;. 


. Interpret the results of the analysis of variance given by the table on page 


400, and estimate the magnitude of the significant effects. 


A computational check on the sums of squares obtained for the various main 
effects and interactions is that their sum must equal the treatment sum of 
squares obtained by analyzing the data first as a two-way classification. 
Perform this check on the sums of squares given in the table on page 400. 


If it is desired to find an expression for an effect total without constructing 
a complete table of signs, one can use the following method, illustrated by 
finding [ABC] for a 24 factorial experiment. We take the expression (a + 1) 
(6 + 1)(c + 1)(d + 1) witha “+” if the corresponding letter does not appear 
in the symbol for the main effect or interaction for which we want to calcu- 
late an effect total, and a “—” if the corresponding letter does appear. Thus, 
for finding [ABC] we write 


(a — 1)(6 — 1)(e — 1)(d + 1) = abcd + abc — abd — acd — bcd — ab 
—ac+ad—bec+bd+cdta+tb 
+ce—d-—l1 


and after arranging the terms in standard order and adding parentheses we 
finally obtain 


[ABC] = —(1) + @ + ©) — (4b) + (c) — (ac) — (bc) + (abe) — (a) 
+ (ad) + (bd) — (abd) + (cd) — (acd) — (bcd) + (abcd) 
(a) Use this method to express [B], [AC], and [ABC] in terms of the treat- 
ment totals in a 23 factorial experiment. 


(b) Use this method to express [AC] and [BCD] in terms of the treatment 
totals in a 2* factorial experiment. 


13.4 CONFOUNDING IN A 2” FACTORIAL 


EXPERIMENT 


In some experiments it is impossible to run all the required experi- 


mental conditions in one block. For example, if a 2? factorial experiment 
involves eight combinations of paint pigments that are to be applied to 
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a surface and baked in an oven which can accommodate only four speci- 
mens, it becomes necessary to divide the eight treatments into two blocks 
(oven runs) in each replicate. As we have pointed out earlier, if the block 
size is too small to accommodate all treatments, this requires special, 
so-called incomplete block designs. 

When the experimental conditions are distributed over several blocks, 
one or more of the effects may become confounded (inseparable) with 
possible block effects, that is, between-block differences. For example, 
if in the 23 factorial experiment referred to in the preceding paragraph, 
experimental conditions a, ab, ac, and abc are included in one oven run 
(Block 1) and experimental conditions 1, 8, c, and be are included in a 
second oven run (Block 2), then the “block effect,” the difference between 
the two block totals, is given by 


[(a) + (ab) + (ac) + (abe)] — [) + ©) + (©) + Ge) 


Referring to the table of signs on page 396, we observe that this quantity 
is, in fact, the effect total [A], so that the estimate of the main effect of 
factor A is confounded with blocks. Note that all other factorial effects 
remain unconfounded; for each other effect total there are two +1 coeffi- 
cients and two —1 coefficients in each block, so that the block effects 
cancel out. This kind of argument can also be used to decide what experi- 
mental conditions to put into each block to confound a given main effect 
or interaction. For instance, had we wanted to confound the ABC inter- 
action with blocks in the above example, we could have put experimental 
conditions a, b, c, and abc, whose totals have +1 coefficients in [ABC], 
into one block, and experimental conditions 1, ab, ac, and be, whose totals 
have —1 coefficients, into another. 

In general, confounding in a 2” factorial experiment can be much more 
complicated than in the example just given. To avoid serious difficulties, 
we shall require that the number of blocks used is a power of 2, say 2”. It 
turns out that the price paid for running a 2” factorial experiment in 2? 
blocks is that a total of 2? — 1 effects are confounded with blocks. To 
make it clear just which effects are confounded, and to indicate a method 
that can be used to confound only certain effects and no others, it is helpful 
to define the term “generalized interaction” as follows: the generalized 
interaction of two effects is the “product” of these effects, with like letters 
cancelled. Thus, the generalized interaction of AB and CD is ABCD, and 
the generalized interaction of ABC and BCD is ABCBCD, or AD. To 
confound a 2” factorial experiment in 2? blocks, the following method can 
be used: one selects any p effects for confounding, making sure that none 
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is the generalized interaction of any of the others selected. Then, it can be 
shown that a further 2? — (p +- 1) effects are automatically confounded 
with blocks; together with the p effects originally chosen, this gives a 
total of 2? — 1 confounded effects in the experiment. The additional con- 
founded effects are, in fact, the generalized interactions of the p effects 
originally chosen. 


Example. To illustrate the construction of a confounded design, let us 
divide a 2* factorial experiment into four blocks so that desired effects 
are confounded with blocks. In actual practice one ordinarily con- 
founds only the higher-order interactions (in the hope that they are 
nonexistent anyhow). Since we have decided upon four blocks, we 
have 2? = 4and p = 2, and we shall arbitrarily select two higher-order 
interactions for confounding. If we were to select ABCD and BCD, 
then their generalized interaction, A, would also be confounded. Thus, 
to avoid confounding any main effects, and to confound as few two- 
factor interactions as possible, we shall select ABD and ACD, noting 
that the BC interaction is also confounded. (Observe that it is impos- 
sible to avoid confounding at least one main effect or two-factor in- 
teraction in this experiment.) 

In order to assign the 16 experimental conditions to four blocks, 
we first distribute them into two blocks, so that the ABD interaction is 
confounded with blocks. Referring to an appropriate table of signs, we 
put all treatments whose totals have a “+1” in the row for [ABD] into 
one block, all those whose totals have a “—1” into a second block, 
and we get the following blocks: 


First block: a b ac bec dad abd cd abcd 
Second block: 1 ab c abe ad bd_ acd bcd 


Note that each experimental condition in the first block has an odd 
number of letters in common with ABD, while each experimental 
condition in the second block has an even number of letters in common 
with ABD. This odd-even rule provides an alternate way of distribut- 
ing experimental conditions among two blocks to confound a given 
effect, and it has the advantage that it does not require the construction 
of a complete table of signs. 

So far, we have confounded the ABD interaction by dividing the 16 
experimental conditions into two blocks; now we shall confound the 
ACD interaction by dividing each of these blocks into two blocks of 
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four conditions each. Using the odd-even rule just described (or a 
table of signs), we obtain the following four blocks: 


Block 1: a be ad abcd 
Block 2: b ac abd cd 
Block 3: ab c bd acd 
Block 4: 1 abe ad_ bed 


By comparing these blocks with a table of signs, or, equivalently, by 
applying the odd-even rule, the reader will be asked to verify in Exercise 
7 on page 416 that the BC interaction is also confounded with blocks, 
while all other effects are left unconfounded. 


The analysis of a confounded 2” factorial experiment is similar to that 
of an unconfounded experiment, with the exception that the sums of 
squares for the confounded effects are not computed, and we compute a 
block sum of squares as if the experiment consisted of br blocks rather 
than b blocks in each of r replicates. 


Continuation of example. Referring to our example of a 2* factorial 
experiment with the ABD, ACD, and BC interactions confounded, 
and using two replicates, we have the following dummy analysis of 
variance table. 


Degrees of 
Source of variation freedom 
Blocks 7 
Main effects 4 
Unconfounded two-factor 5 
interactions 
Unconfounded three-factor 2 
interactions 
Four-factor interaction 1 
Intrablock error 12 


Total 31 
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The sum of squares for blocks is obtained, as usual, by adding the squares 
of the eight block totals, dividing the result by 4 (the number of observa- 
tions in each block), and subtracting the correction term. The total sum 
of squares and the sums of squares for the unconfounded factorial effects 
are obtained in the usual way, and the sum of squares for the intra- 
block error, a measure of the variability within blocks, is obtained by 
subtraction. 


Example. To illustrate the analysis of a confounded 2” factorial experi- 
ment, let us suppose that each replicate of the stepping-switch experi- 
ment described in the preceding section was actually run in four blocks, 
because only four mountings were available for the 16 switches. (The 
order of running the blocks is assumed to have been randomized within 
each replicate, and the assignment of switches is assumed to have been 
randomized within each block.) Assuming also that the ABD, ACD, 
and BC interactions were confounded with the blocks as shown on 
pages 407 and 408, we obtain the following block totals from the data 
on page 398: 


Block 1 Block 2 Block 3 Block 4 


Replicate 1 3,564 3,488 3,743 3,993 


Replicate 2 3,978 4,056 3,799 





Thus, the sum of squares for blocks is given by 


SS(BI) = Coo + GAY ts + G79)" — 28,786,975 


= 101,240 


where the correction factor is the same as in the analysis on page 
398. 

Copying the total sum of squares and the sums of squares for the 
various unconfounded effects from the table on page 400, we obtain 
the analysis of variance table for the confounded factorial experiment 
shown on page 410. In this analysis, the A and C main effects are again 
significant at the 0.01 level, but none of the other main effects or inter- 
actions is significant. 
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Degrees of Sum of Mean 
Source of variation freedom squares square F 
Blocks 7 101,240 14,463 1.58 
Main effects: 
A 1 134,551 134,551 14.68 
B 1 4,632 4,632 < | 
C 1 225,624 225,624 24.62 
D 1 713 713 se | 
Unconfounded 
two-factor 
interactions: 
AB 1 39,410 39,410 4.30 
AC 1 18,673 18,673 2.04 
AD 1 31,689 31,689 3.46 
BD 1 24,698 24,698 2.69 
CD 1 81 81 < i 
Unconfounded 
three-factor 
interactions: 
ABC 1 39,130 39,130 4.27 
BCD 1 14,070 14,070 1.54 
Four-factor 
interaction: 
ABCD 1 385 385 =<] 
Intrablock error 12 109,980 9,165 
Total 31 744,876 





If there is replication, some of the lost information about the con- 
founded effects can be recovered by a further breakdown of the above 
blocks sum of squares. This analysis consists of dividing the sum of 
squares for blocks into a component for each of the confounded effects, 
a component for replications, and a residual component called the 
“interblock error,” which is a measure of the variability between 
blocks. Copying the sum of squares for replicates as well as those for 
the BC, ABD, and ACD interactions from the analysis of variance 
table on page 400, and copying the blocks sum of squares from the 
intrablock analysis of variance table above we obtain the following 
interblock analysis of variance: 
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Degrees of Sum of Mean 
Source of variation | freedom squares square 


Replicates 


Confounded 
effects: 
BC 
ABD 
ACD 


Interblock error 


Total (blocks) 101,240 





Note that the interblock error is obtained by subtraction and that the 
F ratios are obtained by dividing the mean squares for the confounded 
effects and the mean square for replicates by the mean square for the 
interblock error. Only the F test for replicates is significant (at the 0.05 
level). The small number of degrees of freedom for the interblock 
error implies that the sensitivity of these significance tests is very poor; 
in fact, it rarely pays to make this kind of interblock analysis unless 
the number of replications is relatively large. 


13.5. FRACTIONAL REPLICATION 


In studies involving complex production lines, chemical processes such 
as may be encountered in the petroleum, plastics, or metals industries, 
physical-chemical processes such as may be encountered in the electronics 
or space-technology industries, and in many other engineering studies, 
the experimenter is often faced with a large and bewildering array of inter- 
related variables. The principles of factorial experimentation treated so 
far in this chapter help him to “sort out” these variables, to discover 
which of them have the greatest influence on the process under considera- 
tion, and what important interrelationships may exist. 

There are, however, some serious limitations to the simultaneous study 
of a large number of factors. Even if each factor is assigned only two levels, 
one replicate of a six-factor experiment requires 64 observations; there are 
128 observations in one replicate of a seven-factor experiment, and 1,024 
observations in one replicate of a ten-factor experiment. The economic 
and practical limitations of these large numbers make it necessary to seek 
out ways in which the size of factorial experiments can be kept within 
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manageable bounds. Of course, it must be emphasized that there is no 
substitute for careful preliminary planning which, coupled with engineer- 
ing insight, can result in the elimination of many needless factors. 

In spite of the most careful preliminary planning, however, it is often 
difficult to avoid having to include as many as six or ten (or more) factors 
in a single experiment. One way to reduce the size of such an experiment 
would be to break it up into several parts, each part involving the deliberate 
variation of one factor while all others are held fixed. This would have the 
undesirable consequence that we could not study any of the interactions. 
Even if we were to include half the factors in one part and the other half 
in another, such as replacing a ten-factor experiment (requiring 1,024 
observations) by two five-factor experiments (each requiring 32 observa- 
tions), any interaction between factors in the first part and factors in the 
second part would be irretrievably lost. It is possible to overcome some of 
these difficulties by observing that most of the time we are not interested 
in all the interactions. For instance, it is possible to perform only a frac- 
tion of a 2” factorial and yet obtain most of the desired information, say, 
about the main effects and two-factor interactions (but not the higher inter- 
actions). 

The principles involved in fractional replication, that is, in performing 
only a fraction of a complete 2” factorial experiment, are similar to those 
used in confounding. To obtain a half-replicate one selects only one of 
the two blocks into which the experimental conditions have been divided 
by confounding one effect; to obtain a quarter-replicate one selects only 
one of the four blocks into which the experimental conditions have been 
divided by confounding two effects, and so forth. In contrast to a con- 
founded experiment as discussed in Section 13.4, we find that in a fractional 
replicate the effects are confounded, not with blocks, but with each other. 


Example. To illustrate, suppose that only the experimental conditions 
a, b, c, and abc are included in a half-replicate of a 23 factorial experi- 
ment. (This is one block of a 2? factorial experiment with the ABC 
interaction confounded.) Considering the table of signs on page 396 
with all columns except those corresponding to a, b, c, and abc crossed 
out, we find that the effect total for factor A is now given by 


[A] = (a) — (6) — (¢) + (abc) 


If we write (a) = >} yio0n (6) = > Yo102---, and substitute for the 
1=1 1=1 


Vise, the expressions given by the model equation on page 394, we 
obtain 


[A] = —4r[o%o — (By)oo] + € 
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where € is the value of a random variable having zero mean (see 
Exercise 16 on page 418). We thus find that [A] measures the main 
effect of factor A as well as the BC interaction, so that these two effects 
have become inseparable, or confounded. Note also that in the reduced 
table of signs (having columns only corresponding to a, b, c, and abc) 
the signs for [A] and [BC] are identical, and hence [A] = [BC]. In 
Exercise 17 on page 418, the reader will be asked to show that for the 
given fractional replicate the main effect for factor B is, similarly, 
confounded, or aliased, with the AC interaction, while the main effect 
for factor C is confounded, or aliased, with the AB interaction. The 
ABC interaction cannot be estimated. 


With careful design, it is generally possible to confound all main effects 
and two-factor interactions only with interactions of higher order. This is 
illustrated in the following example, where we shall construct a half- 
replicate of a 25 factorial. 


Example. First, we select an effect (usually a higher-order interaction) 
to split the experiment into two blocks, as in confounding. The effect 
chosen is called the defining contrast, and it cannot be estimated at all 
by the fractional replicate. Every other effect is aliased with another 
effect, namely, its generalized interaction (see page 406) with the defining 
contrast. Thus, if the defining contrast is ABCDE, the main effect for 
factor A has the four-factor interaction BCDE as its alias, BC and ADE 
are an alias pair, and so forth. As we have seen, only the combined 
effect (the sum or difference of the aliased effects) can be estimated in 
the experiment. However, if it can be assumed that there are no higher- 
order interactions, one can attribute the effect of the alias pair BC 
and ADE entirely to the two-factor interaction BC, one can attribute 
the effect of the alias pair A and BCDE entirely to the main effect of 
factor A, and so forth. A complete listing of the aliases in a half-repli- 
cate of a 2° factorial experiment having the defining contrast ABCDE 
is as follows: 


Aand BCDE, Band ACDE, Cand ABDE, Dand ABCE 
Eand ABCD, ABand CDE, ACand BDE, ADand BCE 
AE and BCD, BCand ADE, BDand ACE, BEand ACD 
CDand ABE, CEand ABD, DEand ABC 


Note that no main effect or two-factor interaction is aliased with 
another main effect or two-factor interaction. 

The sixteen experimental conditions to be included in the half- 
replicate are given by those in either of the two blocks obtained by 
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confounding the defining contrast. Choosing “evens” in the odd-even 
rule, namely, those conditions which have an even number of letters 
in common with the defining contrast ABCDE, we obtain the follow- 
ing half-replicate: 

1 ad ae de 

ab bd _ be abde 

ac cd ce acde 


bc abcd abce bcde 


To go one step further, let us illustrate how to construct a quarter- 
replicate of the given 2° factorial. We shall do this by dividing the 
above half-replicate in half, confounding the three-factor interaction 
ABC. Again using “evens,” we get the following eight experimental 
conditions: 


1 de ab abde 
ac acde bc _ bcde 


Since DE, the generalized interaction of the two confounded effects, 
is also confounded, we now have the three defining contrasts ABCDE, 
ABC, and DE. None of these effects can be estimated in the quarter- 
replicate, and each other effect is aliased with its three generalized 
interactions with the three defining contrasts. The complete aliasing 
is as follows: 


Alias sets 

A, BCDE, BC, ADE 
B, ACDE, AC, BDE 
C, ABDE, AB, CDE 
D, ABCE, ABCD, E 
AD, BCE, BCD, AE 
BD, ACE, ACD, BE 
CD, ABE, ABD, CE 


We have given this quarter-replicate merely as an illustration; it would 
hardly seem useful in actual practice because of the hopeless aliasing of 
main effects and two-factor interactions. Nevertheless, quarter-replicates 
of six- and seven-factor experiments (and even eighth-replicates of seven- 
and eight-factor experiments) can often provide much useful information. 

The analysis of a fractional factorial is practically the same as that of a 
fully replicated factorial experiment. Given the fraction 1/2? of a 2” fac- 
torial, there are 2”-” experimental conditions, and the method of Yates 
can be used as if the experiment were a 2”? factorial. Some care must be 


415 13.5 Fractional Replication 


exercised to arrange the experimental conditions in a modified “standard 
order” as indicated in Exercise 14 on page 417. 

When dealing with fractional factorials, there is the problem of obtain- 
ing an estimate of the experimental error. For example, in the half-replicate 
of a 25 factorial described on page 413, the breakdown of the total sum of 
squares having 15 degrees of freedom yields sums of squares for main 
effects (5 degrees of freedom), sums of squares for two-factor interactions 
(10 degrees of freedom), but no component (0 degrees of freedom) for the 
experimental error. In a situation like this, and in all other cases where the 
number of degrees of freedom for error is small, it is best to include a 
limited amount of replication. This may be accomplished by randomly 
selecting several experimental conditions, and making additional observa- 
tions corresponding to these conditions. Furthermore, if it can be assumed 
that there are no higher-order interactions, the total of the sums of squares 
corresponding to higher-order interactions which are not aliased with main 
effects or lower-order interactions can be attributed to “error,” and used in 
the denominator of the F test (see Exercise 15 below). In this way, use 
can be made of the “hidden replication” inherent in most large factorial 
experiments. 

To summarize, fractional replication is useful whenever the number of 
factors to be included in an experiment is large and it is not economically 
feasible to include all possible experimental conditions. The reduction in 
size (and, therefore, in cost) of a fractionally replicated experiment is 
partially offset by the loss of information caused by aliasing, and by the 
difficulties inherent in estimating the experimental error. A more detailed 
discussion of fractional replication, including fractional replicates of 3” 
experiments and a variety of other designs, can be found in the book by 
W. G. Cochran and G. M. Cox listed in the Bibliography. 


EXERCISES 


1. A 25 factorial experiment, having the factors A, B, C, D, and E, is to be run 
in several blocks. Show which treatments are assigned to each block if 
(a) there are to be two blocks, with the ACDE interaction confounded; 
(b) there are to be four blocks, with the BDE and ABCE interactions con- 
founded. What other factorial effect or effects are also confounded? 


2. List the experimental conditions included in each block if a 24 factorial 
experiment is to be confounded 
(a) in 2 blocks on ABCD; 
(b) in 4 blocks on ABC and BCD. 
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Js 


What is the largest number of blocks in which one can perform a 2° factorial 
experiment without confounding any main effects? 


. List the experimental conditions included in each block if a 2° factorial 


experiment is to be confounded in 8 blocks on ABDE, BCDF, and ABC. 
What other factorial effects are confounded ? 


. Suppose that in Exercise 1 on page 401 the judge rates the soft drinks in sets 


of four, with a rest period in between, and that the experiment was actually 
performed so that each replicate consisted of two blocks with ABC con- 
founded. Perform an intrablock analysis of the data. 


. Four new drugs are to be investigated to determine their effectiveness as 


tranquilizers, individually and in combination with each other. Each patient 
was given regular doses of one of the sixteen tranquilizers formed from these 
drugs (including a placebo corresponding to the 0 level for each drug) and, 
after a two-week period, the effect of these tranquilizers on the emotional 
stability of each patient was judged (on a scale from 1 to 5) by five psychia- 
trists. To keep the staff work load within reasonable bounds, two hospitals 
were used in this experiment, and eight patients were selected by the staff of 
each hospital for each trial (replicate); thus, the experiment involved two 
replicates of two blocks each, with the ABCD interaction confounded. The 
results of the two 2-week trials were as follows: 


FIRST HOSPITAL SECOND HOSPITAL 
Treatment Mean rating Treatment Mean rating 
combination Trial 1 Trial 2 combination Trial ] Trial 2 
i! 2.0 2.6 a Zuid 2.6 
ab 3.8 3.4 b 3.6 2.0 
ac 4.2 4.8 Cc 2.4 1.8 
be 4.8 4.0 abc 4.0 3.8 
ad 1.8 2.4 d 1.8 Duke 
bd 3.4 3.8 abd 1.6 2.0 
cd 4.6 Ze acd 3.6 2.4 
abcd 4.2 4.6 bcd 3.4 3.8 


If a high rating indicates satisfactory progress, which drug combination (or 
combinations) seems to be the most promising? Perform an intrablock 
analysis of variance to test for significant effects. 


. Verify that in the example on page 407 all treatment totals having +1 asa 


coefficient in [BC] are in Blocks 1 and 4, that all treatment totals having —1 
as a coefficient in [BC] are in Blocks 2 and 3, and, hence, that the BC inter- 
action is confounded with blocks. 


. Suppose that in Exercise 3 on page 402 only 8 specimens could be tested in 


any one shift, and that the experiment was actually performed so that each 
replicate consists of four blocks with ABC, ADE, and BCDE confounded. 
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10. 


11. 


12. 


13. 


14. 


15. 


(a) If for each factor levels 0 and 1 are, respectively, the lower and higher 
values, construct a table showing which experimental conditions go into 
each of the four blocks. 

(b) Perform an intrablock analysis of the experiment. 


. Show that the “odd-even” rule for assigning treatments to blocks, given on 


page 407, is equivalent to the method described in Exercise 11 on page 405. 


Referring to Exercise 1, suppose that in each case the block containing treat- 
ment combination 1 was chosen as a fractional replicate of a 25 factorial. 

(a) Show the alias pairs in the resulting half-replicate. 

(b) Show the alias sets in the resulting quarter-replicate. 

(In practice, random selection should be used to choose the block to be 
included in a fractional replicate.) 


Referring to Exercise 2, list the alias sets if an experiment consists of 

(a) a half-replicate of a 2* factorial experiment with ABCD confounded, 

(b) a quarter-replicate of a 2+ factorial experiment with ABC and BCD con- 
founded. 


Construct a half-replicate of a 2° factorial having the defining contrast 
ABCDEF. List the 32 treatments in the block containing experimental 
condition 1, and show the alias pairs. 


Design an experiment consisting of a quarter-replicate of a 27 factorial with 
ABCDE, ABCFG, and DEFG confounded, if the experimental condition 
with each factor at the 0 level is to be included. Also exhibit all alias sets. 


Referring to Exercise 12, we can define a modified standard order for the 32 
treatment combinations in the half-replicate as follows. First, we list the 32 
treatment combinations corresponding to the five factors A, B, C, D, and 

E in standard order. Then we append the letter f to 16 of these treatment 

combinations, so that the list contains the same ones as the block chosen for 

the half-replicate. 

(a) Use this method to list the treatment combinations obtained in Exercise 
12 in modified standard order. 

(b) Generalize the above rule for arranging the treatment combinations in 
modified standard order so that it applies to a half-replicate of a 2” fac- 
torial experiment. (For a 1/2? fractional replicate of a 2” factorial, a 
modified standard order can be obtained by noting that any block chosen 
contains a subset of n — p letters which form a complete replicate of a 
2"-? factorial. The modified standard order is then obtained by using 
these letters only and later appending the remaining p letters to get the 
required treatment combinations.) 


The following factors are to be studied in a half-replicate of a 2° factorial 
experiment (defining contrast ABCDEF), designed to evaluate several 
chemicals as insecticides. 
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16. 
17. 


Factor Level 0 Level 1 
A. BMC 0% 5 
B. Malathion 37, 6% 
C. Tedion 1% ty A 
D, Chlordane fide A 2% 
E. Lindane 1% 4% 
F. Pyrethrum 27, 4% 


Each experimental unit consists of 10 insects, and the average lifetimes (in 
seconds) after application of the respective insecticides are as follows, in the 
random order in which they are obtained: 


ce 181 acdf 162 bd 135 abdf 131 
ae 172 1 182 df i ab 136 
abef 140 bf 171 acef 159 bcde 105 
bcdf 165 cf 176 be 179 abcdef 109 
acde 139 be 187 ac 165 af 176 
ef 186 abce 131 bcef 181 ad 150 
de 164 abcf Zo cdef 163 abde 1B Be: 
abcd 112 adef 158 bdef 128 cd 166 


(a) Write down the alias pairs. 
(b) Arrange the results in modified standard order (see Exercise 14) and use 
the method of Yates to find the effect totals. 
(c) Identify the effect totals as follows. In the last column of the Yates table, 
write down the 32 combinations [J], [A], [B], [4B], ..., [ABCDE] in 
standard order. Each of these is aliased with another one; identify the 
alias pairs by using the main effect or lower-order interaction. (For 
example, ABC DE is aliased with F—identify the effect total [ABCDE]as 
that of the main effect F.) Note that ten of the alias pairs are three- 
factor interactions aliased with three-factor interactions—label these 
“error.” 
Obtain the mean squares and complete the analysis of variance. Note 
that the error mean square, having 10 degrees of freedom, is the average 
of the mean squares of the ten effects labeled “error.” 


(d 


—_ 


Verify the expression obtained for [A] on page 412. 


Duplicating the steps indicated on page 412, show that in the given design 
the main effect for factor B is confounded with the AC interaction. [Hint: 
Express [B] and [AC]in terms of the parameters of the model.] 








Applications 
to Quality 
Assurance 





14.1 QUALITY ASSURANCE 


Although there is the tendency to think of the subject of quality assurance 
as a recent development, there is nothing new about the basic idea of 
making a quality product characterized by a high degree of uniformity. 
For centuries skilled artisans have striven to make products distinctive 
through superior quality, and once a standard of quality was achieved, to 
eliminate insofar as possible all variability between products that were 
nominally alike. 

The idea that statistics might be instrumental in assuring the quality of 
manufactured products goes back no farther than the advent of mass 
production, and the widespread use of statistical methods in problems of 
quality assurance is even more recent than that. Many problems in the 
manufacture of a product are amenable to statistical treatment, and we 
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have already studied some of them in earlier parts of this book.When we 
speak of (statistical) quality assurance, we are referring essentially to the 
three special techniques to which we shall devote the remainder of this 
chapter: quality control, acceptance sampling, and the establishment of toler- 
ance limits. Note that the word “quality,” when used technically as in this 
discussion, refers to some measurable or countable property of a product, 
such as the outside diameter of a ball bearing, the breaking strength of a 
yarn, the number of imperfections in a piece of cloth, the potency of a 
drug, and so forth. 


14.2 QUALITY CONTROL 


It may surprise some persons to learn that two apparently identical! 
parts made under carefully controlled conditions, from the same batch of 
raw material, and only seconds apart by the same machine, can never- 
theless be different in many respects. Indeed, any manufacturing process, 
however good, is characterized by a certain amount of variability which is 
of a random nature, and which cannot be completely eliminated. 

When the variability present in a production process is confined to 
chance variation, the process is said to be in a state of statistical control. 
Such a state is usually attained by finding and eliminating trouble of the 
sort causing another kind of variation, called assignable variation, which 


may be due to poorly trained operators, poor-quality raw materials, faulty 
machine settings, worn parts, and the like. Since manufacturing processes 
are rarely free from trouble of this kind, it is important to have some 





Figure 14.1. Control chart. 
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systematic method of detecting serious deviations from a state of statis- 
tical control when, or if possible before, they actually occur. It is to this 
end that control charts are principally used. 

In what follows, we shall differentiate between control charts for 
measurements and control charts for attributes, depending on whether the 
observations with which we are concerned are measurements or count 
data (say, the number of defectives in a sample of a given size). In either 
case, a control chart consists of a central line (see Figure 14.1) corre- 
sponding to the average quality at which the process is to perform, and 
lines corresponding to the upper and lower control limits. These limits are 
chosen so that values falling between them can be attributed to chance, 
while values falling beyond them are interpreted as indicating a lack of 
control. By plotting the results obtained from samples taken periodically 
at frequent intervals, it is possible to check by means of such a chart 
whether the process is under control, or whether trouble of the sort in- 
dicated above has entered the process. When a sample point falls beyond 
the control limits, one looks for trouble, but even if the points fall between 
the control limits, a trend or some other systematic pattern may serve 
notice that action should be taken to avoid serious trouble. 

The ability to “read” control charts and to determine from them just 
what corrective action should be taken is a matter of experience and highly 
developed judgment. A quality-control engineer must not only under- 
stand the statistical foundation of his subject, but he must also be thor- 
oughly acquainted with the processes themselves. The engineering and 
managerial aspects of quality control (and quality assurance in general), 
which nowadays includes incoming raw materials, outgoing products, and 
in-process control, constitute an extensive subject in themselves. In this 
chapter we shall present only the statistical aspects of the subject; more 
complete discussions of other aspects can be found in the book by D. J. 
Cowden listed in the Bibliography. 


14.3 CONTROL CHARTS FOR 
MEASUREMENTS 


When dealing with measurements, it is customary to exercise control 
over the average quality of a process as well as its variability. The first goal 
is accomplished by plotting the means of periodic samples on a so-called 
control chart for means, or more succinctly, an x chart. Variability is 
controlled by plotting the sample ranges or standard deviations, respec- 
tively, on an R chart or a chart, depending on which statistic is used to 
estimate the population standard deviation. 
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If the process mean and standard deviation, 4 and o, are known, and 
it is reasonable to treat the measurements as samples from a normal 
population, we can assert with a probability of 1 — « that the mean of a 


random sample of size will fall between u — z, aTr and w+ z, ae 


These two limits on x provide upper and lower control limits, and, under 
the given assumptions, they enable the quality-control engineer to deter- 
mine whether or not to make an adjustment in the process. 

In actual practice, uw and o are usually unknown and it is necessary to 
estimate their values from a large sample (or samples) taken while the 
process is “in control.” For this reason and because there may be no as- 
surance that the measurements can be treated as samples from a normal 
population, the confidence level 1 — @ associated with the control limits 
is only approximate, and such “probability limits” are seldom used in 
practice. Instead, it is common industrial practice to use “three-sigma 
limits” obtained by substituting 3 for z,,.. With three-sigma limits one 
usually is highly confident that the process will not be declared out of 
control when, in fact, it is actually in control. 

If there exists a long history of a process in good control, uw and o can 
be estimated from past data practically without error. Thus, the central 
line of an x chart is given by yw, and the upper and lower three-sigma control 
limits are given by uw + Ao, where A = 3/,/n and n is the size of each 
sample.* For convenience, values of A forn = 2,3,..., and 15 are given 
in Table 13. The use of a constant sample size n simplifies the mainte- 
nance and interpretation of an x chart, but as the reader will observe in 
Exercise 4 on page 432, this restriction is not absolutely necessary. 

In the more common case where the population parameters are un- 
known, it is necessary to estimate these parameters on the basis of prelimi- 
nary samples. For this purpose, it is usually desirable to obtain the results 
of 20 or 25 consecutive samples taken when the process is in control. If k 
samples are used, each of size n, we shall denote the mean of the ith sample 
by x;, and the grand mean of the k sample means by x, that is, 


Laas 
= — Rye 


t=1 





*Throughout this chapter we shall depart somewhat from the customary quality- 
control notation, in order to be consistent with the more widely accepted statistical 
notation used elsewhere in this book. (For instance, in quality control it is customary to 
denote the sample mean and standard deviation by X and oa, and the corresponding 
population parameters by x’ and a’.) 
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The process variability o can be estimated either from the standard devia- 
tions or the ranges of the k samples. Since the sample size commonly used 
in connection with control charts for measurements is small, there 1s 
usually very little loss of efficiency in estimating o from the sample ranges. 
(For an example where sample standard deviations are used in this con- 


nection, see Exercise 5 on page 432.) Denoting the range of the ith sample 
by R,, we shall thus make use of the statistic 





Since ¥ provides an unbiased estimate of the population mean y, the 
central line for the x chart is given by X. The statistic R does not provide 
an unbiased estimate of a, but multiplying R by the constant A,, we obtain 
an unbiased estimate of 3a/,/n. The constant multiplier 4,, tabulated in 
Table 13 for various values of n, depends on the assumption that the 
measurements constitute a sample from a normal population. Thus, the 
central line and the upper and lower three-sigma control limits, UCL and 
LCL, for an X chart (with yu and o estimated from past data) are given by 





central line = x 
UCL =X-+A,R 
LCL = X—A,R 





An example of this kind of control chart for the mean is shown in Figure 
14.2. 





Figure 14.2. x chart. 
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In controlling a process, it may not be enough to monitor the popu- 
lation mean. Although an increase in process variability may become 
apparent from increased fluctuations of the x’s, a more sensitive test of 
shifts in process variability is provided by a separate control chart, an R 
chart based on the sample ranges or a so-called o chart based on the 
sample standard deviations. An example of the latter may be found in 
Exercise 5 on page 432. 

The central line and control limits of an R chart are based on the 
distribution of the range of samples of size m from a normal population. 
As we observed on page 228, the mean and the standard deviation of this 
sampling distribution are given by d,o and d,o, respectively, when a is 
known. Thus, three-sigma control limits for the range are given by d,o0 + 
3d,o, and the complete set of control-chart values for an R chart (with o 
known) is given by 


central line = dao 


UCL = Dio 





Here D, = d, — 3d, and D, = d, + 3d;, and values of these constants 
can be found in Table 13 for various values of n. 

If o is unknown, it is estimated from past data as previously described, 
and the control-chart values for an R chart (with o unknown) are as fol- | 
lows: 


central line = R 


UCL = D,R 
LCL = D;R 


Here D, = D,/d, and D, = D,/d,, and values of these constants can also 
be found in Table 13 for various values of n. 


Example. To illustrate the construction of an x chart and an R chart, 
let us consider the following example. A manufacturer of a certain 
bearing knows from a preliminary record of 20 hourly samples of size 
4 that for the diameters of these bearings X = 0.9752 and R = 0.0002. 
x — 0.9750 


0000L that is, express- 


Coding his data by means of the equation 
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ing each measurement as a deviation from 0.9750 in 0.0001 inch, he 
obtains 


x chart (coded) R chart (coded) 
centralline %=2.0_ central line R = 2.0 
UCL E+ AR = 3.4 UCL DsR = 4.6 
LCL x —A,R=0.6 LCL D3;R=0 


The values of A, = 0.729, D,; = 0, and D, = 2.282 for samples of size 
4 were obtained from Table 13. Graphically, these control charts 
are shown in Figures 14.2 and 14.3, where we have also indicated the 


results subsequently obtained in the following 20 samples: 


Hour Coded sample values x R 
1 ey ye: 1.9 1.2 je 1.0 
2 0.8 13 2.1 0.9 Laz 1.3 
3 1.0 1.4 1.0 1.3 1.18 0.4 
4 0.4 —0.6 0.7 0.2 0.18 1.3 
5 1.4 Z3 2.8 2.7 2.30 1.4 
6 1.8 2.0 11 0.1 1.25 1.9 
7 1.6 1.0 LS 2.0 1.52 1.0 
8 2,3 1.6 1.8 1.2 1.78 1.3 
9 2.9 2.0 0.5 22 1.90 2.4 

10 1 La 3.1 1.6 Lda 2.0 
i 1.7 3.6 Zo 1.8 2.40 1.9 
12 4.6 2.8 3.5 1.9 3.20 Zat 
13 2.6 2.8 3.2 13 pe 1.7 
14 2.3 2A ZA 1.7 2.05 0.6 
15 Lg 1.6 1.8 1.4 1.68 0.5 
16 1.3 2.0 3.9 0.8 2.00 3.1 
17 pe. 1.5 0.6 0.2 1.28 2.6 
18 Let 3.6 0.9 I 1.92 Zul 
19 1.6 0.6 1.0 0.8 1.00 1.0 
20 La 1.0 0.5 22 1.35 ee 


Inspection of Figure 14.2 shows that only one of the points falls outside 
of the control limits, but it also shows that there may nevertheless have 
been a downward shift in the process average. Figure 14.3 shows a 
definite downward shift in the process variability; note especially that 
most of the sample ranges fall below the central line of the R chart. 


The reader may have observed the close connection between the use of 
control charts and the testing of hypotheses. A point on an x chart that is 
out of control corresponds to a sample for which the null hypothesis that 
LL = [Up is rejected. To be more precise, we should say that control-chart 
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Figure 14.3. R chart. 


techniques provide sequential, temporally ordered sets of tests. We are 
interested not only in the positions of individual points, but also in 
possible trends or other patterns exhibited by the points representing 
successive samples. We might, thus, use the sign test (see page 272) to 
check whether there has been a shift in the product average in the above 
example, or a run test (see page 284) to determine whether there is a 
significant trend. 


14.4 CONTROL CHARTS FOR ATTRIBUTES 


Although more complete information can usually be gained from 
measurements made on a finished product, it is often quicker and cheaper 
to check the product against specifications on an “attribute” or “go, no- 
go” basis. For example, in checking the diameter and eccentricity of a ball 
bearing it is far simpler to determine whether it will pass through circular 
holes cut in a template than to make several measurements of the diameter 
with a micrometer. In this section we shall discuss two fundamental kinds 
of control charts used in connection with attribute sampling, the fraction- 
defective chart, also called a “p chart,” and the number-of-defects chart, 
also called a “c chart.” To clarify the distinction between “number defec- 
tive” and “number of defects,” note that a unit tested can have several 
defects, while, on the other hand, it is either defective or it is not. In many 
applications, a unit is referred to as defective if it has at least one defect. 

Control limits for a fraction-defective chart are based on the sampling 
theory for proportions introduced in Section 9.1, namely, on the normal 
curve approximation to the binomial distribution. Thus, if a standard is 
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given, that is, if the fraction defective should take on some preassigned 
value p, the central line is p and three-sigma control limits for the fraction 


defective in random samples of size 1 are given by p + 3 F p(l — p) If no 
n 


standard is given, which is more frequently the case in actual practice, p 
will have to be estimated from past data. If k samples are available, d; is the 
number of defectives in the ith sample, and n; is the number of observa- 
tions in the ith sample, it is customary to estimate p as the proportion of 
defectives in the combined sample, namely, as 


Substituting p for p in the above formulas, we obtain the following 
control-chart values for a fraction-defective chart based on the analysis of 
past data: 


central line = p 


SN 


UCL =p +3,/—P 


LCL =p — 3,2 =D 
n 


Note that if p is small, as is often the case in practice, substitution in 
the formula for the lower control limit might yield a negative number. 
When this occurs, it is customary to regard the lower control limit as if it 
were zero and, in effect, to use only the upper control limit. Another 
complication that can arise if p is small is that the binomial distribution 
may not be adequately approximated by the normal distribution. Gener- 
ally speaking, the use of the above control limits for p charts is unrealistic 
whenever 7 and p are such that the underlying binomial (or hypergeome- 
tric) distribution cannot be approximated by a normal curve (see page 
108). In such cases it is best to use an upper control limit obtained directly 
from a table of binomial probabilities or, perhaps, use the Poisson approx- 
imation to the binomial distribution. 

Equivalent to the p chart for the fraction defective is the control chart 
for the number of defectives. Instead of plotting the fraction defective in a 
sample of size n, one plots the number of defectives, and the control-chart 
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values for this kind of chart are obtained by multiplying the above values 
for the central line and the control limits by n. Thus, if p is estimated by p, 
the control-chart values for a number-of-defectives chart are as follows: 


central line = np 
UCL = np + 3,/np(1 — Pp) 
LCL = np — 3,/np(1 — PD) 


Example. As an illustration of a p chart, suppose it is desired to control 
the output of a certain integrated circuit production line to maintain a 
“vield” of 60 percent, that is, a proportion defective of 40 percent. To 
this end, daily samples of 100 units are checked to electrical specifi- 
cations, with the following results: 


Number of Number of Number of 
Date defectives Date defectives Date defectives 
3-12 24 3-26 44 4- 9 23 
3-13 38 3-27 52 4-10 31 
3-16 62 3-30 45 4-13 26 
3-17 34 3-31 30 4-14 a2 
3-18 26 4- 1 34 4-15 35 
3-19 36 4- 2 33 4-16 15 
3-20 38 4- 3 pe 4-17 24 
3-23 52 4- 6 34 4-20 38 
3-24 33 4- 7 43 4-21 21 
3-25 44 4- 8 28 4-22 16 


Since the standard is given as p = 0.40, the control-chart values are 
central line = 0.40 


UCL = 0.40 + 5 COD) = 055 
100 
LCL = 0.40 — 3,/ SS — 0.25 


The corresponding control chart with points for the 30 sample fractions 
defective is shown in Figure 14.4, and it exhibits some interesting 
characteristics. Note that there is only one point out of control on the 
high side, but there are seven points out of control on the low side. 
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Figure 14.4. p-chart. 


Most of these seven “low points” occurred after April 1, and there 
appears to be a general downward trend. In fact, there is an unbroken 
run of eleven points below the central line after April 7. It would appear 
from this chart that the yield is not yet stabilized, and that the process 
is potentially capable of maintaining a yield well above the nominal 
60 percent value. 


There are situations where it is necessary to control the number of 
defects in a unit of product, rather than the fraction defective. For example, 
in the production of carpeting it is important to control the number of 
defects per hundred yards; in the production of newsprint one may wish 
to control the number of defects per roll. These situations are similar to 
the one described in Section 3.7, which led to the Poisson distribution. 
Thus, if c is the number of defects per manufactured unit, c is taken to be 
a value of a random variable having the Poisson distribution. 

It follows that the center line for a number-of-defects chart is the param- 
eter 1 of the corresponding Poisson distribution, and that three-sigma 
control limits can be based on the fact that the standard deviation of this 
distribution is ,/ A. If 4 is unknown, that is, if no standard is given, its 
value is usually estimated from at least 20 values of c observed from past 
data. If k is the number of units of product available for estimating A, and 
if c, is the number of defects in the ith unit, then J is estimated by 
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and the control-chart values for the c chart are 





central line = ¢ 
UCL =é4+3,/é 
LCL =¢ — 3,/¢ 





Example. To illustrate this kind of control chart, suppose that it is 
known from past experience that on the average an aircraft assembly 
made by a certain company has ¢ = 4 missing rivets. The correspond- 
ing control chart for the number of missing rivets is shown in Figure 
14.5, on which we have also plotted the results of inspection which 
revealed 4, 6,5, 1, 2, 3,5, 7, 1, 2, 2, 4, 6, 5, 3, 2, 4, 1, 8, 4, 5, 6, 3, 4, 
and 2 missing rivets in 25 assemblies. 





Figure 14.5. c-chart. 


EXERCISES 


1. A plastics manufacturer extrudes blanks for use in the manufacture of 
eyeglass temples. Specifications require that the thickness of these blanks 
have 4 = 0.150 inch and a = 0.002 inch. 

(a) Use the specifications to calculate a central line and three-sigma control 
limits for an x chart with n = 5. 

(b) Use the specifications to calculate a central line and three-sigma control 
limits for an R chart with n = 5. 
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(c) Plot the following means and ranges, obtained in 20 successive random 
samples of size 5, on charts based on the control-chart constants obtained 
in parts (a) and (b), and discuss the process. 


Sample x R Sample x R 
1 0.152 0.004 11 0.149 0.003 
2 0.147 0.006 12 0.153 0.004. 
3 0.153 0.004 13 0.150 0.005 
4 0.153 0.002 14 0.152 0.001 
5 0.151 0.003 15 0.149 0.003 
6 0.148 0.002 16 0.146 0.002 
7 0.149 0.006 Ly 0,154 0.004 
8 0.144 0.001 18 0.152 0.005 
9 0.149 0.003 19 0.151 0.002 
10 0.152 0.005 20 0.149 0.004 


2. Calculate ¥ and R for the data of part (c) of Exercise 1, and use these values 
to construct the central lines and three-sigma control limits for new x and 
R charts to be used in the control of the thickness of the extruded plastic 
blanks. 


3. The following data give the means and ranges of 25 samples, each consisting 
of four compression test results on steel forgings, in thousands of pounds 
per square inch: 





Sample 1 2 3 4 5 6 7 8 
x 45.4 48.1 46.2 45.7 41.9 49.4 52.6 54.5 
R Zul 3.1 5.0 1.6 Zid a7 6.5 3.6 

Sample 9 10 I] 12 13 14 IS 16 
x 45.1 47.6 42.8 41.4 43.7 49.2 51.1 42.8 
R pe) 1.0 3.9 5.6 21 3.1 1.5 2.2 


Sample 17 18 19 20 Zi 22 4) 24 25 


Bs S11 52.4 47.9 486 53.3 49.7 48.2 51.6 52.3 
R 1.4 4.3 Lue ea | 3.0 1.1 eae | 1.6 2.4 


(a) Use these data to find the central line and control limits for an x chart. 

(b) Use these data to find the central line and control limits for an R chart. 

(c) Plot the given data on x and R charts based on the control-chart con- 
stants computed in parts (a) and (b), and interpret the results. 

(d) Using runs above and below the central line (similar to runs above and 
below the median discussed on page 284), test at a level of significance of 
0.05 whether there is a trend in the x-values. 
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(e) Would it be reasonable to use the control limits found in this exercise in 
connection with subsequent compression test measurements from the 
same process? Why? 


4. Reverse-current readings (in nanoamperes) are made on a sample of ten 
transistors every half hour. Since some of the units may prove to be “shorts” 
or “opens,” it is not always possible to obtain ten readings. The following 
table shows the number of readings made at the end of each half-hour inter- 
val during an eight-hour shift, and the mean reverse currents obtained. 








Sample 
n 10 6 9 8 8 10 7 9 
x iZ.2 11.1 10.2 11.6 Ze 12.3 9.7 15.6 
Sample 9 10 Il 12 13 14 15 16 
n ‘i 8 10 9 7 8 9 10 
x 16.7 9.8 11.6 Ti2 10.1 9.5 13.1 14.2 


(a) Find the central line for an x chart by taking the weighted mean of the 
sixteen x’s, weighting each value with the size of the corresponding 
sample. 

(b) Construct a table showing the central line found in part (a) and three- 
sigma control limits corresponding to n = 6, 7, 8, 9, and 10. Use R = 
4.0, a value based on prior data. 

(c) Plot the data on a control chart like the one of Figure 14.6 and interpret 
the results. 





Figure 14.6. Exercise 4. 


5, If the sample standard deviations instead of the sample ranges are used to 
estimate a, the control limits for the resulting * chart are given by x + A;S, 
where 5 is the mean of the sample standard deviations obtained from given 
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10. 


data, and A; can be found in Table 13. Note that in connection with prob- 

lems of quality control the sample standard deviation is defined using the 

divisor n instead of n — 1. The corresponding R chart is replaced by a a 

chart, having the central line c, and the upper and lower control limits B35 

and B,s, where B; and B, can be obtained from Table 13. 

(a) Construct an x chart and a o chart for 20 samples of size 3 which had 
x equal to 21.2, 19.4, 20.4, 20.4, 20.4, 19.0, 20.3, 21.1, 21.6, 22.1, 24.4, 
23.9, 24.9, 24.1, 21.8, 19.5, 20.3, 22.5, 23.4, 23.3, and s equal to 2.0, 0.8, 
1.1, 0.9, 1.0, 0.3, 1.3, 2.0, 0.8, 1.0, 1.5, 1.0, 1.5, 0.8, 1.3, 2.9, 4.3, 1.2, 0.3, 
eB he 

(b) Would it be reasonable to use these control limits for subsequent data? 
Why? 


. In order to establish control charts for a boring process, thirty samples of 5 


measurements of the inside diameters are taken, and the results areX = 1.317 
inches and 5 = 0.002 inch. Using the method of Exercise 5, construct an x 
chart for n = 5, and on it plot the following means obtained in 25 successive 
samples: 1.328, 1.330, 1.321, 1.325, 1.332, 1.340, 1.327, 1.321, 1.324, 1.325, 
1,329, 1.326, 1.330, 1.324, 1.328, 1,322, 1.326, 1.327, 1.329, 1,325, 1.324, 
1.329, 1.330, 1.321, and 1.329. Discuss the results. 


- Suppose that in the example of Exercise 6 it is desired to establish control 


also over the variability of the process. Using the method of Exercise 5 and 
the values of X and § given in Exercise 6, calculate the central line and con- 
trol limits for a o chart with n = 5. 


. Thirty-five successive samples of 100 castings each, taken from a production 


line, contained, respectively, 3, 3, 5, 3, 5, 0, 3, 2, 3, 5, 6, 5, 9, 1, 2, 4, 5, 2, 0, 
10, 3, 6, 3, 2, 5, 6, 3, 3, 2, 5, 1, 0, 7, 4, and 3 defectives. If the fraction defec- 
tive is to be maintained at 0.02, construct a p chart for these data and state 
whether or not this standard is being met. 


. The data of Exercise 8 may be looked upon as evidence that the standard of 


2 percent defectives is being exceeded. 

(a) Use the data of Exercise 8 to construct new control limits for the fraction 
defective. 

(b) Using the control limits found in part (a), continue the control of the 
process by plotting the following data on the number of defectives 
obtained in 20 subsequent samples of size n = 100: 2, 4, 2, 4, 7, 5, 3, 2, 
2, 3, 5, 6, 4, 5, 8, 0, 5, 5, 4, and 2. 


The specifications for a certain mass-produced valve prescribe a testing pro- 
cedure according to which each valve can be classified as satisfactory or 
unsatisfactory (defective). Past experience has shown that the process can 
perform so that p = 0.03. Construct a three-sigma control chart for the 
number of defectives obtained in samples of size 100, and on it plot the fol- 
lowing numbers of defectives obtained in such samples randomly selected 
from 30 successive half-day’s production: 3, 4, 2, 1, 5, 2, 1, 2, 3, 1, 3, 2, 2, 
2,1, 1, 2, 0, 4, 3, 1, 0, 2, 4, 0, 1, 5, 7, 3, and 2. 
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11. The standard for a process producing tin plate in a continuous strip is 5 
defects in the form of pinholes or visual blemishes per hundred feet. Based 
on the following set of 25 observations, giving the number of defects per 
hundred feet, can it be concluded that the process is in control to this stan- 
dard? 


Inspection number i &® 2 4 5 @©-7F 8&8 2 100 TL 








Number of defects 3 2 2 4 4 4 6 4 =!1 i 3 5 


Inspection number | 13 14 15 16 17 18 19 20 21 22 23 24 25 





Number of defects 4 6 6 9 5 2 6 5 11 6 6 8 2 


12. A process for the manufacture of 4 feet by 8 feet woodgrained panels has 
performed in the past with an average of 2.7 imperfections per 100 panels. 
Construct a chart to be used in the inspection of the panels and discuss the 
control if 25 successive 100-panel lots contained, respectively, 4, 1, 0, 3, 5, 3, 
5A. 1, 4,0, 1,4, 2; 3, 7,4, 2,1, 3; 0,2, 8,4, 0ne 3 imperfections. 


14.5 TOLERANCE LIMITS 


Inherent in every phase of industrial quality control is the problem of 
comparing some quality characteristic or measurement of a finished 
product against given specifications. Sometimes the specifications, or 
tolerance limits, are so stated by the customer or by the design engineer 
that any appreciable departure will make the product unusable. There 
remains, however, the problem of producing the part so that an acceptably 
high proportion of units will fall within tolerance limits specified for the 
given quality characteristic. Also, if a product is made without prior 
specifications, or if modifications are made, it is desirable to know within 
what limits the process can hold a quality characteristic a reasonably high 
percentage of the time. We, thus, speak of “natural” tolerance limits; that 
is, we let the process establish its own limits which, according to experi- 
ence, can be met in actual practice. 

If reliable information is available about the distribution underlying 
the measurement in question, it is a relatively simple matter to find natural 
tolerance limits. For instance, if long experience with a product enables us 
to assume that a certain dimension is normally distributed with the mean 4 
and the standard deviation a, it is easy to construct limits between which 
we can expect to find any given proportion P of the population. For P = 
0.90 we have the tolerance limits wz + 1.6450, and for P = 0.95 we have 
uu + 1.96a, as can easily be verified from a table of normal curve areas. 
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In most practical situations the true values of uw and o are not known, 
and tolerance limits must be based on the mean x and the standard devia- 
tion s of a random sample. Whereas yw + 1.960 are limits including 95 
percent of a normal population, the same cannot be said for the limits 
x + 1.96s. These limits are values of random variables and they may or 
may not include a given proportion of the population. Nevertheless, it is 
possible to determine a constant K so that one can assert with a degree of 
confidence 1 — a that the proportion of the population contained between 
x — Ks and x + Ks is at least P. Such values of K for random samples 
from normal populations are given in Table 14 for P = 0.90, 0.95, and 
0.99, degrees of confidence of 0.95 and 0.99, and selected values of n from 
2 to 1000. 


Example. To illustrate this technique, suppose that a manufacturer 
takes a sample of size n = 100 from a very large lot of mass-produced 
compression springs and that he obtains x = 1.507 and s = 0.004 
inch for the free lengths of the springs. Choosing a level of confidence 
of 0.99 and a minimum proportion of P = 0.95, he obtains the tole- 
rance limits 1.507 + (2.355)(0.004); in other words, the manufacturer 
can assert with a degree of confidence of 0.99 that at least 95 percent of 
the springs in the entire lot have free lengths from 1.497 to 1.517 inches. 
Note that in problems like these the minimum proportion P as well as 
the degree of confidence 1 — « must be specified; also note that the 
lower tolerance limit is rounded down and the upper tolerance limit is 
rounded up. 


To avoid confusion, let us also point out that there is an essential 
difference between confidence limits and tolerance limits. Whereas con- 
fidence limits are used to estimate a parameter of a population, tolerance 
limits are used to indicate between what limits one can find a certain pro- 
portion of a population. This distinction is emphasized by the fact that 
when n becomes large the length of a confidence interval approaches zero, 
while the tolerance limits will approach the corresponding values for the 
population. Thus, for large n, K approaches 1.96 in the columns for P = 
0.95 in Table 14. 


EXERCISES 


1. To check the strength of carbon steel for use in chain links, the yield stress of 
a random sample of 25 pieces was measured, yielding a mean and a standard 
deviation of 52,800 psi and 4,600 psi, respectively. Establish tolerance limits 
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with & — 0.05 and P = 0.99, and express in words what these tolerance limits 
mean. 


. Ina study designed to determine the number of turns required for an artillery- 


shell fuse to arm, 75 fuses, rotated on a turntable, averaged 38.7 turns with a 
standard deviation of 4.3 turns. Establish tolerance limits for which one can 
assert with a degree of confidence 0.99 that at least 95 percent of the fuses will 
arm within these limits. 


In a random sample of 40 piston rings chosen from a production line, the 

mean edge width was 0.1063 inch, and the standard deviation was 0.0004 inch. 

(a) Between what limits can it be said with 95 percent confidence that at least 
90 percent of the edge widths of piston rings produced by this production 
line will lie? 

(b) Find 95 percent confidence limits for the true mean edge width, and 
explain the difference between these limits and the tolerance limits found 
in part (a). 


. Nonparametric tolerance limits can be based on the extreme values in a random 


sample of size n taken from any continuous population. The following equa- 
tion relates the quantities n, P, and &, where P is the minimum proportion of 
the population contained between the smallest and largest observations with 
confidence 1 — &: 


nP™-! —(n —1)P" =6 
An approximate solution for 7 is given by 


Lg ee de 
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where ¥2 is the value of chi square for 4 degrees of freedom that corresponds 

to a right-hand tail of area &. 

(a) How large a sample is required to be 95 percent certain that at least 90 
percent of the population will be included between the extreme values of 
the sample? 

(b) Atleast what proportion of the population can be expected to be included 
between the extreme values of a sample of size 100 with 95 percent confi- 
dence? 


14.6 ACCEPTANCE SAMPLING 


Manufactured goods are shipped to the purchaser in lots ranging in 


size from only a few to many thousands of individual items. Ideally, each 
lot should not contain any defectives, but practically speaking it is rarely 
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possible to meet this goal. Recognizing the fact that some defective goods 
are bound to be delivered, even if each lot were to be inspected 100 per- 
cent, most consumers require that evidence, based on careful inspection, 
be given that the proportion of defectives in each lot is not excessive. 

A frequently used and highly effective method for providing such 
evidence is that of sampling inspection, where items are selected from each 
lot prior to shipment (or prior to acceptance by the consumer), and a 
decision is made on the basis of this sample whether to accept or reject 
the lot. Acceptance of a lot ordinarily implies that it can be shipped (or 
be accepted by the consumer), even though it may contain some defective 
items. Arrangements between the producer and the consumer may allow 
for some form of credit to be given for defectives subsequently discovered 
by the consumer. Rejection of a lot need not mean that it is to be scrapped; 
a rejected lot may be subjected to closer inspection with the aim of elimi- 
nating all defective items. 

Since the cost of inspection is rarely negligible (sometimes it is nearly 
as high as or higher than the cost of production), it is seldom desirable to 
inspect each item in a lot. Thus, acceptance inspection usually involves 
sampling; more specifically, a random sample is selected from each lot and 
the lot is accepted if the number of defectives found in the sample does not 
exceed a given acceptance number. This procedure is equivalent to a test of 
the null hypothesis that the proportion defective p in the lot equals some 
specified value p, against the alternative that it equals p,, where Pe = De 
In acceptance sampling the value p, is called the acceptable quality level or 
AQL, and p, is called the Jot tolerance percent defective or LTPD. The 
probability of a Type I error, «, can be interpreted as an upper limit to the 
proportion of “good” lots (lots with p < p,) that will be rejected, and in 
this context it is called the producer’s risk. The probability of a Type II 
error, 8, gives an upper bound to the proportion of “bad” lots (lots with 
P = P;) that will be accepted, and it is called the consumer’s risk. 

A single-sampling plan is simply a specification of the sample size and 
the acceptance number to be used, and its choice is usually based on a 
specified AQL and (or) LTPD in association with given producer’s and 
(or) consumer’s risks. A given sampling plan is best described by its operat- 
ing-characteristic or OC curve, which gives the probability of acceptance 
for each value that can be assumed by the lot proportion defective p. 
Thus, the OC curve describes the degree of protection offered by the sam- 
pling plan against incoming lots of various qualities. If a sample of size n 
is taken from a lot containing N units, and if the acceptance number is c, 
the probability of accepting a lot containing the proportion of defectives p 
(the lot contains Np defectives) can be calculated by using the hyper- 
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geometric distribution, as follows: 





L(p) = & hs n, Np. N) 





Example. If the lot size is N = 100, the sample size is n = 10, and the 
acceptance number is c = 1, we have 


end Guha P)) i ew oa P)) 
(0) 


Since calculations involving the hypergeometric distribution are 
fairly tedious, especially when n and WN are large, it is customary in 
acceptance sampling to approximate the hypergeometric distribution 
with the binomial distribution, as on page 62. For instance, for p = 
0.10 the exact value is (0.10) = 0.739, as the reader will be asked to 
verify in Exercise 5 on page 447, while Table 1 with n = 10 and p = 
0.10 yields the binomial approximation 


L(p) = 


L(0.10) = B(1; 10, 0.10) = 0.736 


A sketch of the OC curve for the sampling plan n = 10, c = 1 can be 
made rapidly with the aid of Table 1, and the result is shown in Figure 
14.7. From this curve it can be seen that the producer’s risk is ap- 
proximately 0.05 when the AQL is 0.04, and the consumer’s risk is 
approximately 0.10 when the LTPD is 0.34. 


A sampling plan can also be described by means of its average outgoing 
quality or AOQ curve. This curve describes the degree of protection offered 
by the sampling plan by showing the average quality of outgoing lots 
corresponding to each quality level of incoming lots (that is, lots prior to 
inspection). If incoming lots are of good quality, that is, if their proportion 
defective is smaller than the AOL, very few lots will be rejected and the 
average outgoing quality or AOQ will be good. If the incoming lots are of 
poor quality, that is, if their proportion defective is larger than the L7PD, 
most of them will be rejected. If all rejected lots are inspected 100 percent 
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Figure 14.7. OC curve. 


and all defective units are replaced by good units prior to acceptance of 
the lot, then the average outgoing quality will be good even though the 
average incoming quality is poor. It is when the average incoming quality 
lies between the AQL and the LTPD that the poorest quality of lots will be 
shipped. In general, there will be a maximum AOQ over all values of 
incoming quality p, and this value is called the average outgoing quality 
limit, or AOQL. 

It is not difficult to derive a formula for finding the AOQ corresponding 
to a given incoming quality p under the assumption that all defectives in 
rejected lots are replaced by acceptable items prior to their final accept- 
ance. If the incoming quality is p, the probability that a lot will be accepted 
is L(p), and each such lot contains the proportion p of defectives. The 
proportion 1 — L(p) of lots which are eventually rejected contain no 
defectives, and it follows that the AOQ is given by p-L(p) + 0-[1 — L(p)], 
or 
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AOQ = p:L(p) 


| Jr nn aa att EE aan 


More common practice is to remove or replace defectives found in ac- 
cepted as well as rejected lots, but the modification thus required in the 
AOQ is usually minor and it is customary to use the above formula regard- 
less of the inspection procedure. 


Continuation of Example. The 4OQ curve for the sampling plan n = 
10, c = 1 is shown in Figure 14.8, and it is apparent from this figure 
that the AOQL is approximately 0.081. 





Figure 14.8. AOQ curve. 


Sometimes, smaller samples (and hence, reductions in cost) can be 
achieved without sacrifice in the degree of protection by the use of so- 
called double or multiple sampling. A double sampling plan involves the 
selection of a random sample of size n, from a lot; if the sample contains 
c, or fewer defectives, the lot is accepted; if it contains c, or more defec- 
tives (c’, > c,) the lot is rejected; otherwise, a second sample of size n, 
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is taken from the lot, and the lot is accepted unless the total number of 
defectives in the combined sample of size n, + n, exceeds c,. A multiple 
sampling plan is similar in nature to a double sampling plan, but it involves 
more than two stages. 


Example. An example of a multiple sampling plan is shown in the 
following table: 








Combined samples 
Sample Acceptance Rejection 
Sample size Size number number 
First 20 20 3 
Second 20 40 1 4 
Third 20 60 3 5 
Fourth 20 80 3 6 
Fifth 20 100 5 7 
Sixth 20 120 6 8 
Seventh 20 140 7 8 





In the first step, the lot is rejected if there are 3 or more defectives, 
otherwise sampling continues. In the second step, the lot is accepted if 
the combined sample contains at most | defective, it is rejected if there 
are 4 or more defectives, otherwise sampling continues. This goes on, 
if necessary, until in the final step the lot is accepted if there are at 
most 7 defectives in the combined sample of size 140, and otherwise it 
is rejected. 


By an appropriate choice of the sample sizes and the acceptance and 
rejection numbers it is possible to match the OC curve of a double or 
multiple sampling plan closely to that of an equivalent single sampling 
plan. Thus, the degree of protection offered by a double or multiple sam- 
pling plan can be made essentially the same as that offered by an equiva- 
lent single sampling plan. 

The advantage of double or multiple sampling is that there is a high 
probability that a very good lot will be accepted or a very poor lot will be 
rejected on the basis of the first sample (or, at least, an early sample), thus 
reducing the required amount of inspection. On the other hand, if the lot 
quality is “intermediate,” the total sample size required may actually be 
larger than that of the equivalent single sampling plan. 
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Example. Consider the following double sampling plan: 







Combined samples 








Sample Acceptance Rejection 
Sample size Size number number 
First 15 15 1 5 
Second 30 45 5 6 





If the incoming lot quality is p = 0.05, the probability that a second 
sample will be required is the same as the probability that there are 2 
3, or 4 defectives in a sample of size 15. According to Table 1, this 
probability is equal to 


B(4; 15, 0.05) — B(1; 15, 0.05) = 0.170 


Thus, on the average, it will require a sample of size 15 + (0.170)(30) 
— 20.1 to decide whether to accept or reject an incoming lot of quality 
p = 0.05. Similar calculations enable us to find the average sample size 
required to inspect a lot having any given incoming quality p. A graph 
showing the relation between the average sample size (also called the 
average sample number) and the incoming lot quality is called an ASN 
curve; the ASN curve for the double sampling plan described above is 
shown in Figure 14.9. 


Several standard sampling plans have been published to facilitate the 
use of acceptance sampling (see the book by I. W. Burr listed in the Bibli- 
ography). Among the most widely used standard plans are those contained 
in the Military Standard 105 D Tables (see Bibliography). These plans stress 
the maintenance of a specified AQL, and they are designed to encourage 
the producer to offer only good products to the consumer. To accomplish 
this there are three general levels of inspection corresponding to different 
consumer’s risks. (Inspection level II is normally chosen; level I uses 
smaller sample sizes and level III uses larger sample sizes than level II.) 
There are also three types of inspection: normal, tightened, and reduced. 
The type of inspection depends on whether the average proportion defec- 
tives for prior samples has been above or below the AQL, and it may be 
changed during the course of inspection. Under tightened inspection the 
producer’s risk is increased and the consumer’s risk is (slightly) decreased; 
under reduced inspection the consumer’s risk is increased and the pro- 
ducer’s risk is (slightly) decreased. Tables are available for single, double, 
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Figure 14.9. ASN curve. 


and multiple sampling, and a brief portion of these tables is included in 
this book in Tables 15 and 16. 

The procedure in using MIL-STD-105D for single sampling is first to 
find the sample size code letter corresponding to the lot size and the 
desired inspection level and type. Then, using the sample size code letter 
thus obtained and the appropriate AQL, one finds the sample size and the 
acceptance number from the master table. A portion of the table for 
finding sample size code letters is given in Table 15, and a portion of the 
master table for normal inspection is included in Table 16. 


Example. To illustrate the use of MIL-STD-105D, suppose that 
incoming lots contain 2,000 items, and inspection level II is to be used 
in conjunction with normal inspection and an AQL of 0.025, or 2.5 
percent. From Table 15 we find that the sample size code letter is K. 
Then, entering Table 16 in the row labeled K, we find that the sample 
size to be used is 125. Using the column labeled 2.5, we find that the 
acceptance number is 7 and the rejection number is 8. Thus, if a single 
sample of size 125, selected at random from a lot of 2,000 items, con- 
tains 7 or fewer defectives the lot is to be accepted; if the sample 
contains 8 or more defectives the lot is to be rejected. 
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The concept of multiple sampling is carried to its extreme in so-called 
sequential sampling. A sampling procedure is said to be sequential if, after 
each observation, one of the following decisions is made: accept whatever 
hypothesis is being tested, reject the hypothesis, or take another observa- 
tion. Although sequential procedures are used also in connection with 
other kinds of problems, we shall discuss this kind of sampling only in 
connection with acceptance sampling, where we shall thus decide after 
the inspection of each successive item whether to accept a lot, reject it, or 
continue sampling. 

The construction of a sequential sampling plan consists of finding two 
sequences of numbers a, and r,, where n is the number of observations, 
so that the lot is accepted as soon as the number of defectives is less than 
or equal to a, for some n, the lot is rejected as soon as the number of defec- 
tives is greater than or equal to rv, for some n, and sampling continues so 
long as the number of defectives in a sample of size n falls between a, and 
r,. If an acceptance plan is to have py and p, as its AQL and LTPD, the 
producer’s risk «, and the consumer’s risk f, it can be shown (see the book 
by A. Wald in the Bibliography) that the required values of a, and r, can 
be computed by means of the formulas 








log B jt weep 





Pe — ae 
log See) . 
1 — 
log! = B + lg =P 
r= i 
log=! — lo A 
eT, T= py 





If a, is not an integer, it is replaced by the largest integer less than a,; ifr, 
is not an integer, it is replaced by the smallest integer greater than r,,. 


Example. To illustrate this procedure, let p) = 0.05, p, = 0.20, @ = 
0.05, and B = 0.10. Substituting these values into the above formulas 
for a, and r,, we obtain 


a, = —1.45 + 0.11n 
yr, = 1.86-+ O.1ln 


and, letting nm = 1, 2,3,..., and 25, we get the acceptance and rejec- 
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tion numbers shown in the second and fourth columns of the following 
table: 


Number of Acceptance Number of Rejection 
items inspected number defectives number 
Nl ayn dy r n 
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In the third column we have indicated the results obtained in an inspec- 
tion where the 8th, 17th, 19th, 22nd, and 23rd items are defective, and 
where the inspection terminates with rejection of the lot after inspection 
of the 23rd item. 

The tabular procedure illustrated can be replaced by an equivalent 
graphical procedure for carrying out sequential sampling inspection. 
Plotting the values of a, and r, obtained from the equations on page 
444 (without rounding), we obtain two straight lines like those of Figure 
14.10. Sampling terminates with rejection or acceptance if the number 
of defectives observed falls above the line for r,, or below the line for 
a,, respectively. 


The main advantage of sequential sampling is that it can materially 
reduce the required amount of inspection. Studies have shown that the 
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Figure 14.10. Graphical procedure for sequential sampling. 


average decrease in sample size is often near 50 percent when compared 
with the sample size of equivalent single sampling plans. The major 
disadvantage is that, in a sequential sampling plan, there is no upper limit 
to the number of items that might have to be inspected in order to reach 
a decision concerning the acceptance or rejection of a lot. In fact, the 
sample size is a random variable and its value will occasionally be very 


large. For this reason it is customary to truncate sequential sampling pro- 
cedures by selecting a number N such that a decision must be made to 


accept or reject a lot prior to or atn = N, 


EXERCISES 


1. A sampling plan, calling for a sample of size n = 50, has the acceptance 
number c = 3. Assuming that the lot size is very large, calculate the prob- 
ability of accepting a lot of incoming quality 15 percent defective and the 
probability of rejecting a lot of incoming quality 4 percent defective: 

(a) by calculating the corresponding binomial probabilities; | 
(b) by using the Poisson approximation to the binomial distribution. 


2. A single sampling plan calls for a sample of size 150. 
(a) Use the normal approximation to find the acceptance number c if the 
AQL is to be 2.5 percent and the producer’s risk is to be & = 0.05. 
(b) Using the acceptance number obtained in part (a), determine the con- 
sumer’s risk if the LTPD is to be 6 percent. 
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3. 


10. 


11. 


12. 


13. 


Referring to the sampling plan of Exercise 1, use the Poisson approximation 
to calculate L(p) for p = 0.01, 0.02, ..., 0.19, and 0.20. Sketch the OC 
curve for this sampling plan and read off the consumer’s and producer’s 
risks corresponding to an AQL of 4 percent and an LTPD of 14 percent. 


. A single sampling plan has n = 100 and c = 5S. 


(a) Find the AQL if the producer’s risk is to be 0.025. 

(b) Find the LTPD if the consumer’s risk is to be 0.05. [Hint: Use the normal 
approximation and set up equations leading to quadratic equations in 
Po and p;, respectively. ] 


. With reference to the example of page 438, verify that 


(a) L(0.10) = 0.739, using the exact (hypergeometric) formula; 
(b) L(0.10) ~ 0.736, using the approximate (binomial) formula; 
(c) the OC curve is as given in Figure 14.7; 

(d) the AOQ curve is as given in Figure 14.8. 


. Calculate L(p) for selected values of p and sketch the OC curve for the single 


sampling plan nm = 120, c = 3. [Hint: Assume a large lot size, and use the 
normal approximation to the binomial distribution.] 


. Sketch the AOQ curve for the sampling plan of Exercise 6, and estimate the 


AOQL. 


. Using the values of L(p) obtained in Exercise 3, sketch the 4OQ curve for the 


given sampling plan and estimate the AOQL. 


. Referring to the double sampling plan on page 442, use Table 1 and the 


normal approximation (when necessary) to calculate the producer’s risk & 
when the AQL is py = 0.10. 


Calculate the average sample size required for selected values of p and sketch 
the ASN curve for the following double sampling plan: 


Combined samples 


Acceptance Rejection 
Sample number number 


First 0 3 
Second 3 4 





A lot of 200 items is to be inspected at an AQL of 6.5 percent. If MIL-STD- 
105D is to be used, with normal inspection at general inspection level II, 
what single sampling plan is required? 


An incoming lot of 5,000 items is to be inspected using MIL-STD-105D, with 
normal inspection at general inspection level IJ and an AQL of 2.5 percent. 
What single sampling plan should be used? 


Find formulas for the acceptance and rejection numbers for the sequential 
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sampling plan having an AQL of 0.10, an LTPD of 0.30, a producer’s risk of 
0.05, and a consumer’s risk of 0.10. If a sample should contain defectives on 
the third, fifth, seventh, and eighth trials, would this plan accept or reject the 
corresponding lot prior to the tenth trial? If so, on which trial ? 


A sequential sampling plan is to have pp = 0.01, p; = 0.10, & = 0.05, and 

B = 0.20. 

(a) Determine the acceptance and rejection numbers for n = 1, 2,..., and 
50. 

(b) Userandom numbers and the acceptance and rejection numbers obtained 
in part (a) to simulate the inspection of a very large lot containing 20 
percent defectives. 









Applications 
to Reliability 
and Life Testing 





15.1 INTRODUCTION 


The task of designing and supervising the manufacture of a product has 
been made increasingly difficult by rapid strides in the sophistication of 
modern products and the severity of the environmental conditions under 
which they must perform. No longer can an engineer be satisfied if the 
operation of a product is technically feasible, or if it can be made to 
“work” under optimum conditions. In addition to such considerations as 
cost and ease of manufacture, increasing attention must now be paid to 
size and weight, ease of maintenance, and reliability. The magnitude of 
the problem of maintainability and reliability is illustrated by surveys 
which have uncovered the fact that frequently a high percentage of space- 
age electronic equipment has been in inoperative condition. Military 
surveys have further shown that maintenance and repair expenses for 
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electronic equipment often exceed the original cost of procurement, even 
during the first year of operation. 

The problem of assuring and maintaining reliability has many facets, 
including original equipment design, control of quality during production, 
acceptance inspection, field trials, life testing, and design modifications. 
To complicate matters further, reliability competes directly or indirectly 
with a host of other engineering considerations, chiefly cost, complexity, 
size and weight, and maintainability. In spite of its complicated engineer- 
ing aspects, it is possible to give a relatively simple mathematical definition 
of reliability. To motivate this definition, we call the reader’s attention to 
the fact that a product may function satisfactorily under one set of condi- 
tions but not under other conditions, and that satisfactory performance 
for one purpose does not assure adequate performance for another pur- 
pose. For example, a vacuum tube which is perfectly satisfactory for use in 
a home radio may be entirely unsatisfactory for use in the airborne guid- 
ance system of a missile. Accordingly, we shall define the reliability of a 
product as the probability that it will function within specified limits for at 
least a specified period of time under specified environmental conditions. 
Thus, the reliability of a “standard equipment” automobile tire is close to 
unity for 10,000 miles of normal operation on a passenger car, but it is 
virtually zero for use at the Indianapolis “500.” 

Since reliability has been defined as a probability, the theoretical treat- 
ment of this subject is based essentially on the material introduced in the 
early chapters of this book. Thus, the rules of probability introduced in 
Chapter 2 can be applied directly to the calculation of the reliability of a 
complex system, if the reliabilities of the individual components are known. 
(Estimates of the reliabilities of the individual components are usually 
obtained from statistical life tests, such as those discussed in Sections 15.4 
and 15.5.) 

Many systems can be considered to be series or parallel systems, or a 
combination of both. A series system is one in which all components are so 
interrelated that the entire system will fail if any one of its components 
fails; a parallel system is one that will fail only if all of its components fail. 

Let us first discuss a system of m components connected in series, and 
let us suppose that the components are independent, namely, that the per- 
formance of any one part does not affect the reliability of the others. 
Under these conditions, the probability that the system will function is 
given by the special rule of multiplication for probabilities, and we have 


i maa II R; 
t=1 
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where R, is the reliability of the ith component and R, is the reliability of 
the series system. This simple product law of reliabilities, applicable to 
series systems of independent components, vividly demonstrates the effect 
of increased complexity on reliability. 


Example. Suppose a system consists of five independent components 
in series, each having a reliability of 0.970; then the reliability of the 
whole series system is (0.970)*° = 0.859. If the system complexity were 
increased so that it contained 10 similar components, its reliability 
would be reduced to (0.970)!° = 0.738. Looking at the effect of in- 
creasing complexity in another way, we find that each of the compo- 
nents in the ten-component system would require a reliability of 0.985, 
instead of 0.970, for the ten-component system to have a reliability 
equal to that of the original five-component system. 


One way to increase the reliability of a system is to replace certain 
components by several similar components connected in parallel. If a 
system consists of nm independent components connected in parallel, it will 
fail to function only if all nm components fail. Thus, if F; = 1 — R, is the 
“unreliability” of the ith component, we can again apply the special rule 
of multiplication for probabilities to obtain 


F,=T1F, 


where F, is the unreliability of the parallel system, and R, = 1 — F, is 
the reliability of the parallel system. Thus, for parallel systems we have 
a product law of unreliabilities analogous to the product law of reliabilities 
for series systems. Writing this law in another way, we get 


R,=1—]]Q—8) 


for the reliability of a parallel system. 


Example. The two basic formulas for the reliability of series and 
parallel systems can be used in combination to calculate the reliability 
of a system having both series and parallel parts. To illustrate such a 
calculation, consider the system diagrammed in Figure 15.1, which 
consists of eight components having the reliabilities shown in that 
figure. The parallel assembly C, D, E can be replaced by an equivalent 
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Figure 15.1. Systems reliability. 


component C’ having the reliability 1 — (1 — 0.70)? = 0.973, without 
affecting the overall reliability of the system. Similarly, the parallel 
assembly F, G can be replaced by a single component F”’ having the 
reliability 1 — (1 — 0.75)? = 0.9375. The resulting series system A, B, 
C"", F’, H, equivalent to the original system, has the reliability 


(0.95)(0.99)(0.973)(0.9375)(0.90) = 0.772. 


15.2 FAILURE-TIME DISTRIBUTIONS 


According to the definition of reliability given in the preceding section, 


the reliability of a system or a component will often depend on the length 
of time it has been in service. Thus, of fundamental importance in reliabil- 


ity studies is the distribution of the time to failure of a component under 
given environmental conditions. A useful way to characterize this distribu- 
tion is by means of its associated instantaneous failure rate, defined as 
follows: If f(t) is the probability density of the time to failure of a given 
component, that is, the probability that the component will fail between 
times ¢ and ¢ + At is given by f(t)-Az, then the probability that the com- 
ponent will fail on the interval from 0 to tis given by 


FO) = | fe) dx 


and the reliability function, expressing the probability that it survives to 
time ¢, is given by 


R(t) = 1 — F(t) 


Thus, the probability that the component will fail in the interval from t to 
t+ At is F(t + At) — F(t), and the conditional probability of failure in 
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this interval, given that the component survived to time t, is expressed by 


Fit + At) — F(t) 
R(@) 


Dividing by At, we obtain the average rate of failure in the interval from ¢ 
to t + At, given that the component survived to time f: 


PG = BO FG) 1 
At R(t) 


Taking the limit as At — 0, we get the instantaneous failure rate, or simply 
the failure rate 


Z(t) = ao 


where F’(t) is the derivative of F(t) with respect to ¢. Finally, observing 
that f(t) = F’(t) (see page 100) we get the relation 


~jIM. J 
ZO = RG) =T— FO 


which expresses the failure rate in terms of the distribution of failure times. 
Example. A failure-rate curve that is typical of many manufactured 


items is shown in Figure 15.2. The curve is conveniently divided into 
three parts. The first part is characterized by a decreasing failure rate 





Figure 15.2. Typical failure-rate curve. 
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and it represents the period during which poorly manufactured items 
are weeded out. (It is common in the electronics industry to “burn in” 
components prior to actual use in order to eliminate any early failures.) 
The second part, which is often characterized by a constant failure 
rate, is normally regarded as the period of useful life during which only 
chance failures occur. The third part is characterized by an increasing 
failure rate, and it is the period during which components fail primarily 
because they are worn out. Note that the same general failure-rate 
curve is typical of human mortality, where the first part represents 
infant mortality, and the third part corresponds to old-age mortality. 


Let us now derive an important relationship expressing the failure- 
time density in terms of the failure-rate function. Making use of the fact 
that R(t) = 1 — F(t) and, hence, that F’(t) = —R’(t), we can write 


R(t) 
RO 


_ _ dln R@)] 
dt 


Z(t) = — 


Solving this differential equation for R(t), we obtain 


R(t) = e7 Jo 2) 


and, making use of the relation f(t) = Z(¢)- R(t), we finally get 
f(t) = Z(t)-e- S24 


As illustrated in Figure 15.2, it is often assumed that the failure rate is 
constant during the period of useful life of a component. Denoting this 
constant failure rate by a, where a > 0, and substituting « for Z(t) in the 
formula for f(t), we obtain 


JG): =0-e" t>0 


Thus, we observe that the distribution of failure times is an exponential 
distribution when it can be assumed that the failure rate is constant. For 
this reason, the assumption of constant failure rates is sometimes also 
called the “exponential assumption.” Interpreting the time to failure as a 
waiting time, we can use the results of Section 4.7 to conclude that the 
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occurrence of failures is a Poisson process, if a component which fails is 
immediately replaced with a new one having the same constant failure rate 
g.. As we observed on page 119, the mean waiting time between successive 
failures is 1/0, or the reciprocal of the failure rate. Thus, the constant 1/a 
is often referred to as the mean time between failures, and it is abbreviated 
MTBF. 

There are situations in which the assumption of a constant failure rate 
is not realistic, and in many of these situations one assumes instead that 
the failure rate function increases or decreases “smoothly” with time. In 
other words, it is assumed that there are no discontinuities or turning 
points. This assumption would be consistent with the initial and last stages 
of the failure-rate curve shown in Figure 15.2. 

A useful function that is often used to approximate such failure-rate 
curves is given by 


A= opr™ t>0 


where « and f are positive constants. Note the generality of this function: 
If @ <1 the failure rate decreases with time, if B > 1 it increases with 
time, and if B — 1 the failure rate equals «. Note that the assumption of a 
constant failure rate, the exponential assumption, is thus included as a 
special case. 
If we substitute the above expression for Z(t) into the formula for f(t) 
on page 454, we obtain 


fOH=apr'e t=O 


where a and f are positive constants. This density, or distribution, is the 
Weibull distribution, introduced in Section 4.9, and we shall discuss its 
application to problems of life testing in Section 15.5. 


15.3 THE EXPONENTIAL MODEL IN 
RELIABILITY 


If we make the exponential assumption about the distribution of failure 
times, some very useful results can be derived concerning the MTBF, the 
mean time between failure, of series and parallel systems. In order to use 
the product laws of Section 15.1, we shall first have to obtain a relation 
expressing the reliability of a component in terms of its service time ¢. 
Making use of the fact that 


Cs one eee 3s Ce ee i f(t) dt 
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we obtain 


R@)=1-—- i ce dy oe 


for the reliability function of the exponential model. Thus, if a component 
has a failure rate of 0.05 per thousand hours, the probability that it will 
survive at least 10,000 hours of operation is e7‘°-°>)!° = 0.607. 

Suppose now that a system consists of nm components connected in 
series, and that these components have the respective failure rates «,, 
O.,...,and4,. The product law of reliabilities can be written as 





R,@) = II en wt — eC x): 
i=1 


and it can be seen that the reliability function of the series system also 
satisfies the exponential assumption. The failure rate of the entire series 


system is readily identified as >) «,, the sum of the failure rates of its 
i=1 


components. Since the MTBF is the reciprocal of the failure rate when 
each component which fails is replaced immediately with another having 
the identical failure rate, we obtain the formula 





expressing the MTBF y, of a series system in terms of the MTBF’s yw; of 
its components. In the special case where all m components have the same 
failure rate « and hence the same MTBF yw, the system failure rate is na, 
and the system MTBF is 1/na = u/n. 

For parallel systems the results are not quite so simple. If a system 
consists of m components in parallel, having the respective failure rates 
1, 0%5,-..,,, the system “unreliability” to time ¢ is given by 


F,(¢) = J] ( — e**) 


Thus, the failure-time distribution of a parallel system is not exponential 
even when each of its components satisfies the exponential assumption. 
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The system failure-rate function can be obtained by means of the formula 
Z(t) = F,(t)/R,(t), but the result is fairly complicated. Note, however, 
that the system failure rate is not constant, but depends on ¢, the “age” 
of the system. 

The mean time to failure of a parallel system is also difficult to obtain in 
general, but in the special case where all components have the same failure 
rate «, an interesting and useful result can be obtained. In this special 
case the system reliability function becomes 


R(t) =1—(1—e™)" 
i? (Te = (oe +e. t (1) lennet 


after using the binomial theorem to expand (1 — e~*‘)". Then, making use 
of the fact that f,(¢) = — R(t), we obtain 


Fyfe) = a( 7 en — 20( 3 Jer +... + (Iter 
and the mean of the failure time distribution is given by 
Mp = | t-fy(O at 


— a( 7) [te dt — 2a( 5 | [tetra +... 
0 0 
+ (—1)*"'no \, te-"™* dt 
-E(})-2G)+ tems 


It can be proved by induction that this expression is equivalent to 


(l4+5+...+—) 


1 
od oars 





Thus, if a parallel system consists of n components having the identical 
failure rate «, the mean time between failures of the system equals 
(1 + a See =) times the common MTBF of its components, pro- 


vided each defective component is replaced whenever the whole parallel 
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system fails. Thus, if we use two parallel components rather than one, the 
mean time to failure of the pair exceeds that of the single component by 50 
percent, rather than doubling it. In general, the above formula for yw, 
expresses a rather severe law of diminishing returns for parallel redun- 
dancy. 


Example. To illustrate how the formulas derived in this section can be 
used in system design, let us again consider the system diagrammed in 
Figure 15.1 on page 452. Assuming the exponential model and that the 
reliabilities are given for 10 hours of operation, we can calculate the 
failure rate of component A by solving the equation 0.95 = e=1% for 
a, and we obtain o = (5.1)10~? failures per hour, or 5.1 failures per 
thousand hours. The failure rates of all eight components (in failures 
per thousand hours) are as shown in the following table: 


Component A B G D E F G H 





Failure rate SA 10 33.7% 35.7 33.7 288 28.8 105 


To compute the mean time to failure for the entire system, we first 
obtain the mean failure times for the parallel assemblies C, D, E, and 
F, G, respectively. For C, D, E we have Ucpr = 7(! + + + =) ss 
0.051 thousand hours, or 51 hours; for F, G we have Uy = 


_ (1 + 5) = 0.052 thousand hours, or 52 hours. Although the two 


parallel assemblies do not have constant failure rates, we shall approxi- 
mate their respective failure rates by 1/0.051 = 19.6 and 1/0.052 = 19.2 
failures per thousand hours, and treat the entire system as a series sys- 
tem. Thus, the system failure rate is given approximately by 5.1 + 1.0 
+ 19.6 + 19.2 + 10.5 = 55.4 failures per thousand hours, and the 
mean time to failure of the system is approximately 1/55.4 = 0.018 or 
18 hours. 


EXERCISES 


1. An old-fashioned string of Christmas-tree lights has 8 bulbs connected in 
series. What would have to be the reliability of each bulb if there is to be a 
95 percent chance of the string’s lighting after a year’s storage? 


2. A systefh consists of 7 identical components connected in parallel. What 
must be the reliability of each component if the overall reliability of the 
system is to be 0.90? 
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3. A system consists of 6 components connected as in Figure 15.3. Find the 
overall reliability of the system, given that the reliabilities of A, B, C, D, E, 
and F are, respectively, 0.95, 0.80, 0.90, 0.99, 0.90, and 0.85. 





Figure 15.3. Exercise 3. 


4. Suppose the flight of an aircraft is regarded as a system having the three 
main components A (aircraft), B (pilot), and C (airport). Suppose, further- 
more, that component B can be regarded as a parallel subsystem consisting 
of B,; (captain), B, (first officer), and B; (flight engineer); and C is a parallel 
subsystem consisting of C; (scheduled airport) and C, (alternate airport). 
Under given flight conditions, the reliabilities of components A, B,, B2, B3, 
C,,and C, (defined as the probabilities that they can contribute to the success- 
ful completion of the scheduled flight) are, respectively, 0.9999, 0.9995, 
0.999, 0.20, 0.95, and 0.85. 

(a) What is the reliability of the system? 

(b) What is the effect on system reliability of having a flight engineer who is 
also a trained pilot, so that the reliability of B3 is increased from 0.20 to 
0.99? 

(c) If the flight crew did not have a first officer, what then would be the 
effect of increasing the reliability of B; from 0.20 to 0.99? 

(d) What is the effect of adding a second alternate landing point, C3, with 
reliability 0.80? 


5. In some reliability problems we are concerned only with initial failures, 
treating a component as if (for all practical purposes) it never failed, once it 
has survived past a certain time ¢ = &. In a problem like this, it may be 
reasonable to use the failure rate 


Z(t) = pi — =) for0<t<&@ 


0 elsewhere 


(a) Find expressions for f(t) and F(f). 
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10. 


(b) Show that the probability of an initial failure is given by 


a 


. As has been indicated in the text, one often distinguishes between initial 


failures, random failures during the useful life of the product, and wear-out 
failures. Thus, suppose that for a given product the probability of an initial 
failure (a failure prior to time ¢t = &) is 0,, the probability of a wear-out 
failure (a failure beyond time ¢ = f) is @,, and that for the interval « < fr 
< f the failure-time density is given by 


_d — 4; — @) 
i laa ere 
(a) Find an expression for F(t) for the interval a<t< f. 
(b) Show that for the interval « < t < P the failure rate is given by 


1—06, —0, 
(B — a1 — 6,)—- (1 — 8; — 2)¢ — &) 


(c) Suppose that the failure of a color television set is considered to be an 
initial failure if it occurs during the first 100 hours of usage, and a wear- 
out failure if it occurs after 15,000 hours. Assuming that the model given 
in this exercise holds and that 8, and 9, equal 0.05 and 0.75, respectively, 
sketch the graph of the failure-rate function from t = 100 to t = 15,000 
hours. 


Z(t) = 


. An integrated circuit chip has a constant failure rate of 0.02 per thousand 


hours. 

(a) What is the probability that it will operate satisfactorily for at least 
20,000 hours ? 

(b) What is the 5,000-hour reliability of a component consisting of four such 
chips connected in series? 


. Acertain component has an exponential life distribution with a failure rate 


of & = 0.0045 failures per hour. 

(a) What is the probability that the component will fail during the first 250 
hours it is in operation? 

(b) What is the probability that two such components will both survive the 
first 100 hours of operation? 


. A system containing several identical components in parallel is to have a 


failure rate of at most 5-1075 per hour. What is the least number of compo- 
nents that must be used if each has a constant failure rate of 2.0 x 1074 per 
hour? | 


A system consists of six different components connected in series. Find the 
MTBF of the system if the six components have exponential time-to-failure 
distributions with failure rates of 1.8, 2.4, 2.0, 1.3, 3.0, and 1.5 per 1,000 
hours, respectively. 
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11. A certain part has an exponential life distribution with a mean life, that is, an 
MTBF, of 1,000 hours. 
(a) What is the probability that such a part will last at least 500 hours? 
(b) What is the probability that among three such parts at least one will fail 
during the first 1,000 hours? 
(c) What is the probability that among four such parts exactly two will fail 
during the first 600 hours? 


12. If a component has the Weibull failure-time distribution with the parameters 
& = 0.005 and # = 0.80, find the probability that it will operate successfully 
for at least 5,000 hours. 


13. Throughout Sections 15.2 and 15.3 we assumed that the products with which 
we were concerned were in continuous operation. Consequently, the models 
discussed in these sections do not apply if we want to investigate the ability 
of light bulbs to withstand successive voltage overloads, the performance of 
switches which are repeatedly turned on and off, etc. In each of these cases 
failure can occur on the xth trial (x = 1, 2, 3,...), and it is often assumed 
that the probability of failure on the xth trial equals some constant p, pro- 
vided the item has not failed prior to that trial. 

(a) Show that the probability of failure on the xth trial is given by 


F(x) = p(l — p)*"! 


for x = 1, 2,3,.... This probability distribution is the geometric distri- 
bution introduced in Section 3.8. 

(b) Find F(x) for the probability distribution obtained in part (a). 

(c) What is the probability that a switch will survive 2,000 cycles of opera- 
tion, if the above model holds and the constant probability of failure as 
the result of any one of the switching cycles is p = 6 X 1074? 


15.4 THE EXPONENTIAL MODEL IN LIFE 
TESTING 


An effective and widely used method of handling problems of reliabil- 
ity is that of life testing. For the purpose of such tests, a random sample of 
n components is selected from a lot, put on test under specified environ- 
mental conditions, and the times to failure of the individual components 
are observed. If each component that fails is immediately replaced by a 
new one, the resulting life test is called a replacement test; otherwise, the 
life test is called a nonreplacement test. Whenever the mean lifetime of the 
components is so large that it is not practical, or economically feasible, to 
test each component to failure, the life test may be truncated, that is, it may 
be terminated after the first r failures have occurred (r < n), or after a 
fixed period of time has been accumulated. 
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A special method that is often used when early results are required in 
connection with very-high-reliability components is that of accelerated life 
testing. In an accelerated life test the components are put on test under 
environmental conditions that are far more severe than those normally 
encountered in practice. This causes the components to fail more quickly, 
and it can drastically reduce both the time required for the test and the 
number of components that must be placed on test. Accelerated life testing 
can be used to compare two or more types of components for the purpose 
of obtaining a rapid assessment of which is the most reliable. Sometimes, 
preliminary experimentation is carried out to determine the relationship 
between the proportion of failures that can be expected under nominal 
conditions and under various levels of accelerated environmental condi- 
tions. The methods of Sections 11.4 and 13.2 can be applied in this connec- 
tion to determine “derating curves,” relating the reliability of the com- 
ponent to the severity of the environmental conditions under which it is to 
operate. 

In the remainder of this section we shall assume that the exponential 
model holds, namely, that the failure-time distribution of each component 
is given by 


ft)=Ga-e ” t>0,a > 0 


In what follows, we shall assume that n components are put on test, life 
testing is discontinued after a fixed number, r (r < 1), of components have 
failed, and that the observed failure times are ¢; Mt, <...<t,. We 
shall be concerned with estimating and testing hypotheses about the mean 
life of the component, namely, uw = 1/a. 

Using theory developed in the article by B. Epstein mentioned in the 
Bibliography, it can be shown that unbiased estimates of the mean life of 
the component are given by 





where 7, is the accumulated life on test until the rth failure occurs, and 
hence 


r 


T, = iia 7, 
1 


l= 
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for nonreplacement tests and 
a a 


l=. 





if the test is with replacement. Note that if the test is without replacement 
and r = n, fi is simply the mean of the observed times to failure. 

To make inferences concerning the mean life 4 of the component, we 
use the fact that 27,/y is a value of a random variable having the chi-square 
distribution with 2r degrees of freedom (see reference to B. Epstein in the 
Bibliography). With the appropriate expression substituted for T ,» this is 
true regardless of whether the test is conducted with or without replace- 
ment. Thus, in either case a two-sided 1 — @ confidence interval for yu is 
given by 





oF pal & 
aif <— < r 
6) v Xi 


ee | 





where yj and 73 cut off left- and right-hand tails of area «/2 under the chi- 
square distribution with 2r degrees of freedom. (See Exercise 6 on page 
471.) 

Tests of the null hypothesis that w= yu, can also be based on the 
sampling distribution of 2T,/u, using the appropriate expression for T, 
depending on whether the test is with or without replacement. Thus, if 
the alternative hypothesis is u > Mo, we reject the null hypothesis at the 
level of significance « when 27,/, exceeds y?, or 





‘ > 4 Uo X2 





where v2, to be determined for 2r degrees of freedom, is as defined on 
page 176. In Exercises 2 and 4 on page 470 the reader will be asked to con- 
struct and perform similar tests corresponding to the alternative hypothe- 
ses UU < fl, and UF My. | 

An alternate life-testing procedure consists of discontinuing the test 
after a fixed accumulated amount of lifetime T has elapsed, and treating 
the observed number of failures k as the value of a random variable. (In 
the important special case where 7 items are tested with replacement for a 
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length of time ¢*, we have T = nt*.) Regardless of whether the test is with 
or without replacement, an approximate 1 — « confidence interval for the 
mean life of the component is given by 


oF AF 
Se Se 
XA eS 


Here y? cuts off a right-hand tail of area «/2 under the chi-square distribu- 


tion with 2k + 2 degrees of freedom, while 73 cuts off a left-hand tail of 
area o/2 under the chi-square distribution with 2k degrees of freedom. 


Example. To illustrate some of the methods presented in this section, 
let us consider the following example. Suppose that 50 units are placed 
on life test (without replacement) and that the test is to be truncated 
after r = 10 of them have failed. We shall suppose, furthermore, that 
the first 10 failure times are 65, 110, 380, 420, 505, 580, 650, 840, 910, 
and 950 hours. Thus, m = 50, r = 10, 


T,, = (65+ 110+... + 950) + (50 — 10)950 
= 43,410 hours 


43,410 
10 
hours. The failure rate a is estimated by 1/@ = 0.00023 failures per 
hour, or 0.23 failures per thousand hours. Also, a 0.90 confidence 

interval for yw is given by 


and we estimate the mean life of the component as @ = = 4,341 





2(43,410) 2(43,410) 
31.410 ~~ <~{0.851 


or 
2,764 < uw < 8,001 


Suppose it were also desired to use the above sample to test whether 
the failure rate is 0.40 failures per thousand hours against the alterna- 
tive that the failure rate is less. This is equivalent to a test of the null 
hypothesis 4 = 1,000/0.40 = 2,500 hours against the alternative that 
ut > 2,500 hours. Using a level of significance of 0.05, we find that the 
critical value for T,, for this test is given by 4(2,500)(31.410) = 39,263 
hours, and since this is exceeded by the observed value T,,) = 43,410, 
the null hypothesis must be rejected. We conclude that the mean life- 
time exceeds 2,500 hours, or equivalently, that the failure rate is less 
than 0.40 failures per thousand hours. 
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15.5. THE WEIBULL MODEL IN LIFE 
TESTING 


Although life testing of components during the period of useful life is 
generally based on the exponential model, we have already pointed out 
that the failure rate of a component may not be constant throughout a 
period under investigation. In some instances the period of initial failure 
may be so long that the component’s main use is during this period, and in 
other instances the main purpose of life testing may be that of determining 
the time to wear-out failure rather than chance failure. In such cases the 
exponential model generally does not apply, and it is necessary to substi- 
tute a more general assumption for that of a constant failure rate. 

As we observed on page 455, the Weibull distribution adequately 
describes the failure times of components when their failure rate either 
increases or decreases with time. It has the parameters «@ and £, its formula 
is given by 


f® =aBrte? t>0,a>0,8>0 


and it follows (see Exercise 11 on page 471) that the reliability function 
associated with the Weibull distribution is given by 





R(t) _ en uth 


We already showed on page 455 that the failure rate leading to the Weibull 
distribution is given by 


Z(t) = aftr! 


The range of shapes a graph of the Weibull density can take on is very 
broad, depending primarily on the value of the parameter f. As illustrated 
in Figure 15.4, the Weibull curve is asymptotic to both axes and highly 
skewed to the right for values of # less than 1; it is identical to that of the 
exponential density for 6 = 1, and it is “bell-shaped” but skewed for 
values of # greater than 1. 
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Figure 15.4. Weibull density functions (« = 1). 


The mean of the Weibull distribution having the parameters a and £ 
may be obtained by evaluating the integral 


= [tea BeP-te-e dt 
0 
Making the change of variable u = at’, we get 


f= ane | u'/Fe-* dy 
0 


Recognizing the integral asT'( 1 + $ , we find that the mean time to failure 
for the Weibull model is 





A 


w= ard + i 
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The reader will be asked to show in Exercise 12 on page 471 that the 
variance of this distribution is given by 





mma 2) L(+ Ml 


Estimates of the parameters a and £ of the Weibull distribution are 
somewhat difficult to obtain. Although there exist analytical methods for 
estimating these parameters, they involve the solution of a system of 
transcendental equations, and they will not be presented here. Instead, a 
more rapid and commonly used method, based on a graphical technique, 
will be described. This method is based on the fact that the reliability 
function of the Weibull distribution can be transformed into a linear func- 
tion of In ¢ by means of a double-logarithmic transformation. Taking the 
natural logarithm of R(t), we obtain 





In RY) = —at? or Ina. = aye 


R(t) 


Again taking logarithms, we have 





In Ing = Ing + B-Int 





and it can be seen that the right-hand side is linear in In ¢. 

To estimate « and £ we require estimates of R(t) for various values of f, 
and the usual procedure is to place n units on life test and observe their 
failure times. If the ith unit fails at time t,, we estimate F(t, = 1 — R(t,) 
by the same method as on page 143, namely, we use the estimator 


lon —_ 
FG) = 


Before going any further, it is customary to check whether it is actually 
reasonable to use the Weibull model. To this end, we plot points having 


the coordinates fr; and F (i) on special graph paper having its scales trans- 
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formed so that the divisions on the horizontal axis are proportional to In ¢ 

] 
[Fo If the 
points fall reasonably close to a straight line, it can be assumed that the 
underlying failure-time distribution is of the Weibull type. The parameters 
a and f of this distribution can then be estimated by applying the linear 
regression methods of Chapter 11 to fit a straight line to the transformed 
data. 


and those on the vertical scale are proportional to In In 


Example. To illustrate this procedure, let us consider the following 
numerical example. Suppose that a sample of 100 components is put 
on life test for 500 hours and that the times to failure of the 12 com- 
ponents that failed during the test are as follows: 6, 21, 50, 84, 95, 130, 
205, 260, 270, 370, 440, and 480 hours. These points are plotted in 
Figure 15.5 on the transformed scales previously described, and it can 
be seen that they fall fairly close to a straight line. 





Figure 15.5. Weibull failure-time distribution. 


The parameters « and f are estimated by applying the method of 
least squares to the transformed points (x;, y;), where 


Xi — In t; 
i 


y; = InIn —— 
| — FF) 


Thus, in our numerical example we obtain 
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F(t) t; Xi yi 

0.005 6 1.79 —5,30 
0.015 21 3.04 —4,20 
0.025 50 3.91 — 3.68 
0.035 84 4.43 — 3,33 
0.045 95 4.55 —3.08 
0.055 130 4.87 9:37 
0.065 205 5.32 —2.70 
0.075 260 5.56 —2.55 
0.085 270 5.60 —2.42 
0.095 370 5.91 —2.30 
0.105 440 6.09 —2.20 
0.115 480 6.17 9.10 


and, by the methods of Section 11.1, the regression line becomes 
y = —6.44 + 0.71x 
Note that in calculating the values of y, for the above table, it is con- 


venient to use the approximation In In 





i = = In z (see Exercise 13 on 


page 471) for small values of F(t). 

Thus, the parameter 8 of the underlying Weibull distribution is 
estimated as B = 0.71, and o is estimated as @ = e-°*4 = 0.0016. It 
follows that the mean time to failure is estimated as 


A = 0.0016)" F(1 + x71) 


which equals approximately 11,000 hours. Also, values of the failure- 
rate function may be obtained by substituting for ¢ into 


Ze) = (0.0016)(0.71)2-%-25 


Since B <1 the failure rate is decreasing with time. After one hour 
(¢ = 1), units are failing at the rate of (0.0016)(0.71) = 0.00114 units 
per hour, and after 1,000 hours the failure rate has decreased to 
(0.00114)(1000)-°:2° = 0.00015 units per hour. 


EXERCISES 


1. Suppose that 50 units are put on life test, each unit that fails is immediately 
replaced, and the test is discontinued after 8 units have failed. If the eighth 
failure occurred at 760 hours, 
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(a) construct a 0.95 confidence interval for the mean life of the units; 
(b) test at the 0.05 level of significance whether or not the mean life is less 
than 10,000 hours. 


2. In a life test with replacement, 35 space heaters were put into continuous 
operation, and the first five failures occurred after 250, 380, 610, 980, and 
1,250 hours. 

(a) Assuming the exponential model, construct a 0.99 confidence interval 
for the mean life of this kind of space heater. 

(b) To check the manufacturer’s claim that the mean life of these heaters is 
at least 5,000 hours, test the null hypothesis 4 = 5,000 against an appro- 
priate alternative, so that the burden of proof is put on the manufacturer. 
Use & = 0.05. 


3. Fifteen assemblies are put on accelerated life test without replacement, and 
the test is truncated after four failures. If the first four failures occurred at 
16.5, 19.2, 20.8, and 37.3 hours, 

(a) find a 0.90 confidence interval for the failure rate of these assemblies 
under these accelerated conditions; 

(b) test the null hypothesis that the failure rate is 0.004 failures per hour 
against the alternative that it is less than 0.004, using the 0.01 level of 
significance. 


4. To investigate the average time to failure of a certain weld subjected to con- 
tinuous vibration, seven welded pieces were subjected to specified frequencies 
and amplitudes of vibration and their times to failure were 211, 350, 384, 
510, 539, 620, and 715 thousand cycles. 

(a) Assuming the exponential model, construct a 0.95 confidence interval 
for the mean life (in thousands of cycles) of the weld under the given 
vibration conditions. 

(b) Assuming the exponential model, test the null hypothesis that the mean 
life of the weld under the given vibration conditions is 500,000 cycles 
against the two-sided alternative. Use a level of significance of 0.10. 


5. In life testing we are sometimes interested in establishing tolerance limits 
for the life of a component (see Section 14.5); in particular, we may be inter- 
ested in a one-sided tolerance limit ¢*, for which we can assert with a degree 
of confidence 1 — & that at least 100-P percent of the components have a 
life exceeding ¢*. Using the exponential model, it can be shown that a good 
approximation is given by 


— —2T,(n P) 
ae ¢ 


where T, is as defined on pages 462 and 463 and the value of 72 is to be 

obtained from Table 5 with 2r degrees of freedom. 

(a) Using the data of Exercise 2, establish a lower tolerance limit for which 
one can assert with a degree of confidence of 0.95 that it is exceeded by 
at least 80 percent of the lifetimes of the heaters. 
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11. 


12. 


13. 


15.5 The Weibull Model in Life Testing 


(b) Using the data of Exercise 4, establish a lower tolerance limit for which 
one can assert with a degree of confidence of 0.99 that it is exceeded by 
at least 90 percent of the lifetimes of the given welds. 


. Using the fact that 27;,/y is a value of a random variable having the chi- 


square distribution with 2r degrees of freedom, derive the confidence interval 
for ££ given on page 463. 


- One hundred devices are put on life test and the times to failure (in hours) 


of the first 10 that fail are as follows: 7.0, 14.1, 18.9, 31.6, 52.8, 80.0, 164.5, 
355.4, 451.0, and 795.1. Assuming a Weibull failure-time distribution, esti- 
mate the parameters & and f as well as the failure rate at 1,000 hours. How 
does this value of the failure rate compare with the value we would obtain if 


we assumed the exponential model? 


. A sample of 300 high-reliability capacitors was placed on life test for 2,000 


hours and there were no failures; the test was then terminated. Find a 0.95 
lower confidence limit for the mean life of the capacitors. 


. To investigate the performance of a logic circuit for a small electronic calcu- 


lator, a laboratory puts 75 of the circuits on life test (without replacement) 
under specified environmental conditions, and the first 10 failures are obser- 
ved after 28, 46, 50, 63, 81, 101, 116, 137, 159, and 175 hours. Using the 
Weibull model, estimate the mean life of the circuit. How does this value 
compare with the mean life that would have been obtained under the expon- 
ential assumption ? 


Using the estimates of the parameters of the Weibull model obtained in 
Exercise 9, estimate the probability that this kind of circuit will perform 
satisfactorily for at least 100 hours. 

Show that the reliability function associated with the Weibull failure-time 
distribution is given by 


R(t) = e- ath 


Derive the formula for the variance of the Weibull distribution given on 
page 467. 


Show that In In can be approximated by In z for small values of z. 





L—z 
[Hint: Note that 1 _ =1+2z2+27+ 23... for |z| <1, and then use 


the Maclaurin’s series for In (1 + x).] 
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Table 1 
BINOMIAL DISTRIBUTION FUNCTION 
x n 
F a k (1 —n)\n-k 
B(x; n, Dp) a, , |? (1 —p) 


n xX 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 


2 2 {| 0.9025 0.8100 0.7225 0.6400 0.5625 0.4900 0.4225 0.3600 0.3025 0.2500 
1 | 0.9975 0.9900 0.9775 0.9600 0.9375 0.9100 0.8755 0.8400 0.7975 0.7500 


3 O | 0.8574 0.7290 0.6141 0.5120 0.4219 0.3430 0.2746 0.2160 0.1664 0.1250 
0.9928 0.9720 0.9392 0.8960 0.8438 0.7840 0.7182 0.6480 0.5748 0.5000 


— 


2 | 0.9999 0.9990 0.9966 0.9920 0.9844 0.9730 0.9571 0.9360 0.9089 0.8750 
4 0 | 0.8145 0.6561 0.5220 0.4096 0.3164 0.2401 0.1785 0.1296 0.0915 0.0625 
1 | 0.9860 0.9477 0.8905 0.8192 0.7383 0.6517 0.5630 0.4752 0.3910 0.3125 
2 | 0.9995 0.9963 0.9880 0.9728 0.9492 0.9163 0.8735 0.8208 0.7585 0.6875 
3 | 1.0000 0.9999 0.9995 0.9984 0.9961 0.9919 0.9850 0.9744 0.9590 0.9375 
5 0 | 0.7738 0.5905 0.4437 0.3277 0.2373 0.1681 0.1160 0.0778 0.0503 0.0312 
1 | 0.9774 0.9185 0.8352 0.7373 0.6328 0.5282 0.4284 0.3370 0.2562 0.1875 
2 | 0.9988 0.9914 0.9734 0.9421 0.8965 0.8369 0.7648 0.6826 0.5931 0.5000 
3 {| 1.0000 0.9995 0.0078 0.9933 0.9844 0.9692 0.9460 0.9130 0.8688 0.8125 
4 1} 1.0000 1.0000 0.9999 0.9997 0.9990 0.9976 0.9947 0.9898 0.9815 0.9688 
6 O | 0.7351 0.5314 0.3771 0.2621 0.1780 0.1176 0.0754 0.0467 0.0277 0.0156 
1 | 0.9672 0.8857 0.7765 0.6554 0.5339 0.4202 0.3191 0.2333 0.1636 0.1094 
2 | 0.9978 0.9842 0.9527 0.9011 0.8306 0.7443 0.6471 0.5443 0.4415 0.3438 
3 | 0.9999 0.9987 0.9941 0.9830 0.9624 0.9295 0.8826 0.8208 0.7447 0.6562 
4 | 1.0000 0.9999 0.9996 0.9984 0.9954 0.9891 0.9777 0.9590 0.9308 0.8906 
5 | 1.0000 1.0000 1.0000 0.9999 0.9998 0.9993 0.9982 0.9959 0.9917 0.9844 
7 O | 0.6983 0.4783 0.3206 0.2097 0.1335 0.0824 0.0490 0.0280 0.0152 0.0078 
1 | 0.9556 0.8503 0.7166 0.5767 0.4449 0.3294 0.2338 0.1586 0.1024 0.0625 
2 | 0.9962 0.9743 0.9262 0.8520 0.7564 0.6471 0.5323 0.4199 0.3164 0.2266 
3 | 0.9998 0.9973 0.9879 0.9667 0.9294 0.8740 0.8002 0.7102 0.6083 0.5000 
4 | 1.0000 0.9998 0.9988 0.9953 0.9871 0.9712 0.9444 0.9037 0.8471 0.7734 
5 | 1.0000 1.0000 0.9999 0.9996 0.9987 0.9962 0.9910 0.9812 0.9643 0.9375 
6 | 1.0000 1.0000 1.0000 1.0000 0.9999 0.9998 0.9994 0.9984 0.9963 0.9922 
8 O | 0.6634 0.4305 0.2725 0.1678 0.1001 0.0576 0.0319 0.0168 0.0084 0.0039 
1 | 0.9428 0.8131 0.6572 0.5033 0.3671 0.2553 0.1691 0.1964 0.0632 0.0352 
2 | 0.9942 0.9619 0.8948 0.7969 0.6785 0.5518 0.4278 0.3154 0.2201 0.1445 
3 | 0.9996 0.9950 0.9786 0.9437 0.8862 0.8059 0.7064 0.594i 0.4770 0.3633 
4 | 1.0000 0.9996 0.9971 0.9896 0.9727 0.9420 0.8939 0.8263 0.7396 0.6367 
5 | 1.0000 1.0000 0.9998 0.9988 0.9958 0.9887 0.9747 0.9502 0.9115 0.8555 
6 | 1.0000 1.0000 1.0000 0.9999 0.9996 0.9987 0.9964 0.9915 0.9819° 0.9648 
7 | 1.0000 1.0000 1.0000 1.0000 1.0000 0.9999 0.9998 0.9993 0.9983 0.9961 
9 O | 0.6302 0.3874 0.2316 0.1342 0.0751 0.0404 0.0207 0.0101 0.0046 0.0020 
1 | 0.9288 0.7748 0.5995 0.4362 0.3003 0.1960 0.1211 0.0705 0.0385 0.0195 
2 | 0.9916 0.9470 0.8591 0.7382 0.6007 0.4628 0.3373 0.2318 0.1495 0.0898 
3 | 0.9994 0.9917 0.9661 0.9144 0.8343 0.7297 0.6089 0.4826 0.3614 0.2539 
4 | 1.0000 0.9991 0.9944 0.9804 0.9511 0.9012 0.8283 0.7334 0.6214 0.5000 
5 | 1.0000 0.9999 0.9994 0.9969 0.9900 0.9747 0.9464 0.9006 0.8342 0.7461 
6 | 1.0000 1.0000 1.0000 0.9997 0.9987 0.995% 0.9888 0.9750 0.9502 0.9102 
7 | 1.0000 1.0000 1.0000 1.0000 0.9999 0.9996 0.9986 0.9962 0.9909 0.9805 
8 | 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 0.9999 0.9997 0.9992 0.9980 
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0.5987 
0.9139 
0.9885 
0.9990 
0.9999 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


0.5688 
0.8981 
0.9848 
0.9984 
0.9999 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 


0.5404 
0.8816 
0.9804 
0.9978 
0.9998 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 


0.5133 
0.8646 
0.9755 
0.9969 
0.9997 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 


0.4877 
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0.3487 
0.7361 
0.9298 
0.9872 
0.9984 


0.9999 
1.0000 
1.0000 
1.0000 
1.0000 


0.3138 
0.6974 
0.9104 
0.9815 
0.9972 


0.9997 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 


0.2824 
0.6590 
0.8891 
0.9744 


‘0.9957 


0.9995 
0.9999 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 


0.2542 
0.6213 
0.8661 
0.9658 
0.9935 


0.9991 
0.9999 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 


0.2288 


0.1969 
0.5443 
0.8202 
0.9500 
0.9901 


0.9986 
0.9999 
1.0000 
1.0000 
1.0000 


0.1673 
0.4922 
0.7788 
0.9306 
0.9841 


0.9973 
0.9997 
1.0000 
1.0000 
1.0000 


1.0000 


0.1422 
0.4435 
0.7358 
0.9078 
0.9761 


0.9954 
0.2993 
0.9999 
1.0000 
1.0000 


1.0000 
1.0000 


0.1209 
0.3983 
0.6920 
0.8820 
0.9658 


0.9925 
0.9987 
0.9998 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 


0.1028 


0.3567 


0.1074 
0.3758 
0.6778 
0.8791 
0.9672 


0.9936 
0.9991 
0.9999 
1.0000 
1.0000 


0.0859 
0.3221 
0.6174 
0.8389 
0.9496 


0.9883 
0.9980 
0.9998 
1.0000 
1.0000 


1.0000 


0.0687 
0.2749 
0.5583 
0.7946 
0.9274 


0.9806 
0.9961 
0.9994 
0.9999 
1.0000 


1.0000 
1.0000 


0.0550 
0.2336 
0.5017 
0.7437 
0.9009 


0.9700 
0.9930 
0.9988 
0.9998 
1.0000 


1.0000 
1.0000 
1.0000 


0.0440 


0.0563 
0.2440 
0.5256 
0.7759 
0.9219 


0.9803 
0.9965 
0.9996 
1.0000 
1.0000 


0.0422 
0.1971 
0.4552 
0.7133 
0.8854 


0.9657 
0.9924 
0.9988 
0.9999 
1.0000 


1.0000 


0.0317 
0.1584 
0.3907 
0.6488 
0.8424 


0.9456 
0.9857 
0.9972 
0.9996 
1.0000 


1.0000 
1.0000 


0.0238 
0.1267 
0.3326 
0.5843 
0.7940 


0.9198 


0.27357 
0.9944 
0.9990 
0.9999 


1.0000 
1.0000 
1.0000 


0.0178 


0.0282 
0.1493 
0.3828 
0.6496 
0.8497 


0.9527 
0.9894 
0.9984 
0.9999 
1.0000 


0.0198 
0.1130 
0.3127 
0.5696 
0.7897 


0.9218 
0.9784 
0.9957 
0.9994 
1.0000 


1.0000 


0.0138 
0.0850 
0.2528 
0.4925 
0.7237 


0.8822 
0.9614 
0.9905 
0.9983 
0.9998 


1.0000 
1.0000 


0.0097 
0.0637 
0.2025 
0.4206 
0.6543 


0.8346 
0.9376 
0.9818 
0.9960 
0.9993 


0.9999 
1.0000 
1.0000 


0.0068 


0.0135 
0.0860 
0.2616 
0.5138 
0.7515 


0.9051 
0.9740 
0.9952 
0.9995 
1.0000 


0.0088 
0.0606 
0.2001 
0.4256 
0.6683 


0.8513 
0.9499 
0.9878 
0.9980 
0.9998 


1.0000 


0.0057 
0.0424 
0.1513 
0.3467 
0,5833 


0.7873 
0.9154 
0.9745 
0.9944 
0.9992 


0.9999 
1.0000 


0.0037 
0.0296 
0.1132 
0.2783 
0.5005 


0.7159 
0.8705 
0.9538 
0.9874 
0.9975 


0.9997 
1.0000 
1.0000 


0.0024 


0.0060 
0.0464 
0.1673 
0.3823 
0.6331 


0.8338 
0.9452 
0.9877 
0.9983 
0.9999 


0.0036 
0.0302 
0.1189 
0.2963 
0.5328 


0.7535 
0.9006 
0.9707 
0.9941 
0.9993 


1.0000 


0.0022 
0.0196 
0.0834 
0.2253 
0.4382 


0.6652 
0.8418 
0.9427 
0.9847 
0.9972 


0.9997 
1.0000 


0.0013 
0.0126 
0.0579 
0.1686 
0.3530 


0.5744 
OF712 
0.9023 
0.9679 
0.9922 


0.9987 
0.9999 
1.0000 


0.0008 


0.0025 
0.0232 
0.0996 
0.2660 
0.5044 


0.7384 
0.8980 
0.9726 
0.9955 
0.9997 


0.0014 
0.0139 
0.0652 
0.1911 
0.3971 


0.6331 
0.8262 
0.9390 
0.9852 
0.9978 


0.9998 


0.0008 
0.0083 
0.0421 
0.1345 
0.3044 


0.5269 


0.7393 


0.8883 
0.9644 
0.9921 


0.9989 
0.9999 


0.0004 
0.0049 
0.0269 
0.0929 
0.2279 


0.4268 
0.6437 
0.8212 
0.9302 
0.9797 


0.9959 
0.9995 
1.0000 


0.0002 


0.0010 
0.0107 
0.0547 
0.1719 
0.3770 


0.6230 
0.8281 
0.9453 
0.9893 
0.9990 


0.0005 
0.0059 
0.0327 
0.1133 
0.2744 


0.5000 
0.7256 
0.8867 
0.9673 
0.9941 


0.9995 


0.0002 
0.0032 
0.0193 
0.0730 
0.1938 


0.3872 
0.6128 
0.8062 
0.9270 
0.9807 


0.9968 
0.9998 


0.0001 
0.0017 
0.0112 
0.0461 
0.1334 


0.2905 
0.5000 
0.7095 
0.8666 
0.9539 


0.9888 
0.9983 
0.9999 


0.0001 
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0.05 


0.9699 
0.9958 
0.9996 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 


0.4633 
0.8290 
0.9638 
0.9945 
0.9994 


0.9999 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


0.4401 
0.8108 
0.9571 
0.9930 
0.9991 


0.9999 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 


0.4181 
0.7922 
0.9497 
0.9912 
0.9988 
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0.8416 
0.9559 
0.9908 


0.9985 
0.9998 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 


0.2059 
0.5490 
0.8159 
0.9444 
0.9873 


0.9978 
0.9997 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


0.1853 
0.5147 
0.7892 
0.9316 
0.9830 


0.9967 
0.999% 
0.9999 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 


0.1668 
0.4818 
0.7618 
0.9174 
OTT? 


0.6479 
0.8535 
0.9533 


0.9885 
0.9978 
0.9997 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 


0.0874 
0.3186 
0.6042 
0.8227 
0.9383 


0.9832 
0.9964 
0.9996 
0.9999 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


0.0743 
0.2839 
0.5614 
0.7899 
0.9209 


0.9765 
0.9944 
0.9989 
0.9998 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 


0.0631 
0.2525 
0.5198 
0.7556 
0.9013 


0.20 


0.4481 
0.6982 
0.8702 


0.9561 
0.9884 
0.9976 
0.9996 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 


0.0352 
0.1671 
0.3980 
0.6482 
0.8358 


0.9389 
0.9819 
0.9958 
0.9992 
0.9999 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


0.0281 
0.1407 
0.3518 
0.5981 
0.7982 


0.9183 
0.9733 
0.9930 
0.9985 
0.9998 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 


0.0225 
0.1182 
0.3096 
0.5489 
0.7582 


0.25 


0.2811 
0.5213 
0.7415 


0.8883 
0.9617 
0.9897 
0.9978 
0.9997 


1.0000 
1.0000 
1.0000 
1.0000 


0.0134 
0.0802 
0.2361 
0.4613 
0.6865 


0.8516 
0.9434 
0.9827 
0.9958 
0.9992 


0.9999 
1.0000 
1.0000 
1.0000 
1.0000 


0.0100 
0.0635 
0.1971 
0.4050 
0.6302 


0.8103 
0.9204 
0.9729 
(0).9925 
0.9984 


0.9997 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 


0.0075 
0.0501 
0.1637 
0.3530 
0.5739 


0.30 


0.1608 
0.3552 
0.5842 


0.7805 
0.9067 
0.9685 
O.9917 
0.9983 


0.9989 
1.0000 
1.0000 
1.0000 


0.0047 
0.0353 
0.1268 
0.2969 
0.5155 


0.7216 
0.8689 
0.9500 
0.9848 
0.9963 


0.9993 
O.9999 
1.0000 
1.0000 
1.0000 


0.0033 
0.0261 
0.0994 
0.2459 
0.4499 


0.6598 
0.8247 
0.9256 
0.9743 
0.2929 


0.9984 
0.9997 
1.0000 
1.0000 
1.0000 


1.0000 


0.0023 
0.0193 
0.0774 
0.2019 
0.3887 


0.35 


0.0839 
0.2205 
0.4227 


0.6405 
0.8164 
0.9247 
O97S7 
0.9940 


0.9989 
0.9999 
1.0000 
1.0000 


0.0016 
0.0142 
0.0617 
0.1727 
0.3049 


0.5643 
0.7548 
0.8868 
0.9578 
0.9876 


0.9972 
0.9995 
0.9999 
1.0000 
1.0000 


0.0010 
0.0098 
0.0451 
0.1339 
0.2892 


0.4900 
0.6881 
0.8406 
0.9329 
0.9771 


0.9938 
0.9987 
0.9998 
1.0000 
1.0000 


1.0000 


0.0007 
0.0067 
0.0327 
0.1028 
0.2348 


0.40 


0.0398 
0.1243 
0.2793 


0.4859 
0.6925 
0.8499 
0.9417 
0.9825 


0.9961 
0.9994 
0.9999 
1.0000 


0.0005 
0.0052 
0.0271 
0.0905 
0.2173 


0.4032 
0.6098 
0.7869 
0.9050 
0.9662 


0.9907 
0.9981 
0.9997 
1.0000 
1.0000 


0.0003 
0.0033 
0.0183 
0.0651 
0.1666 


0.3288 
0.5272 
0.7161 
0.8577 
0.9417 


0.9809 
0.9951 
0.9991 
0.9999 
1.0000 


1.0000 


0.0002 
0.0021 
0.0123 
0.0464 
0.1260 


0.45 


0.0170 
0.0632 
0.1672 


0.3373 
0.5461 
0.7414 
0.8811 
0.9574 


0.9886 
0.9978 
0.9997 
1.0000 


0.0001 
0.0017 
0.0107 
0.0424 
0.1204 


0.2608 
0.4522 
0.6535 
0.8182 
0.9231 


 OBT4S 


U99S7 
0.9989 
0.9999 
1.0000 


0.0001 
0.0010 
0.0066 
0.0281 
0.0853 


0.1976 
(0.3660 
0.5629 
0.7441 
0.8759 


().9514 
0.9851 
0.9965 
0.9994 
1.0000 


1.0000 


0.0000 
0.0006 
0.0041 
0.0184 
0.0596 


0.50 


0.0065 
0.0287 
0.0898 


0.2120 
0.3953 
0.6074 
0.7880 
0.9102 


0.9715 
O.9935 
0.9991 
0.9999 


0.0000 
0.0005 
0.0037 
0.0176 
0.0592 


0.1509 
0.3036 
0.5000 
0.6964 
0.8491 


0.9408 
0.9824 
0.9963 
0.9995 
1.0000 


0.0000 , 
0.0003 \\ 
0.0021 | 
0.0106 
0.0384 


0.1054 
0.2272 
0.4018 
0.5982, 
0.7728 


0.8949 | 
0.9616” 
0.9894-~ 
0.9979 
0.9997 


1.0000 


0.0000 
0.0001 
0.0012 
0.0063 
0.0245 
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0.05 


0.9999 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 


U.3972 
0.7735 
0.9419 
0.9891 
0.9985 


0.9998 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 


0.3774 
0.7547 
0.9335 
0.9868 
0.9980 


0.9998 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 


| 1.0000 


0.10 


0.9953 
0.9992 
0.9999 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 


0.1501 
0.4503 
0.7338 
0.9018 
0.9718 


0.9936 
0.9988 
0.9998 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 


0.1351 
0.4203 
0.7054 
0.8850 
0.9648 


0.9914 
0.9983 
0.9997 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 


BINOMIAL DISTRIBUTION FUNCTION (Continued) 


0.15 


0.9681 
0.9917 
0.9983 
0.9997 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 


0.0536 
0.2241 
0.4797 
0.7202 
0.8794 


0.9581 
0.9882 
0.9973 
0.9995 
0.9999 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 


0.0456 
0.1985 
0.4413 
0.6841 
0.8556 


0.9463 
0.9837 
0.9959 
0.9992 
0.9999 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 


0.20 


0.8943 
0.9623 
0.9891 
0.9974 
0.9995 


0.9999 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 


0.0180 
0.0991 
0.2713 
0.5010 
0.7164 


0.8671 
0.9487 
0.9837 
0.9957 
0.9991 


0.9998 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 


0.0144 
0.0829 
0.2369 
0.4551 
0.6733 


0.8369 
0.9324 
0.9767 
0.9933 
0.9984 


0.9997 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 


Table 1 


0.25 


0.7653 
0.8929 
0.9598 
0.9876 
0.9969 


0.9994 
0.9999 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 


0.0056 
0.0395 
0.1353 
0.3057 
0.5187 


0.7175 
0.8610 
0.9431 
0.9807 
0.9946 


0.9988 
0.9998 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 


0.0042 
0.0310 
0.1113 
0.2630 
0.4654 


0.6678 
0.8251 
0.9225 
0.9713 
O.9911 


0.9977 
0.9995 
0.9999 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 


0.5968 
0.7752 
0.8954 
0.9597 
0.9873 


0.9968 
0.9993 
0.9999 
1.0000 
1.0000 


1.0000 
1.0000 


0.0016 
0.0142 
0.0600 
0.1646 
0.3327 


0.5344 
0.7217 
0.8593 
0.9404 
0.9790 


0.9939 
0.9986 
0.9997 
1.0000 
1.0000 


1.0000 
1.0000 


0.0011 
0.0104 
0.0462 
0.1332 
0.2822 


0.4739 
0.6655 
0.8180 
0.9161 
().9674 


0.9895 
0.9972 
0.9994 
0.9999 
1.0000 


1.0000 
1.0000 
1.0000 


0.4197 
0.6188 
0.7872 
0.9006 
0.9617 


0.9880 
0.9970 
0.9994 
0.9999 
1.0000 


1.0000 
1.0000 


0.0004 
0.0046 
0.0236 
0.0783 
0.1886 


0.3550 
0.5491 
0.7283 
0.8609 
0.9403 


0.9788 
0.9938 
0.9986 
0.9997 
1.0000 


1.0000 
1.0000 


0.0003 
0.0031 
0.0170 
0.0591 
0.1500 


0.2968 
0.4812 
0.6656 
0.8145 
0.9125 


0.9653 
0.9886 
0.9969 
0.9993 
0.9999 


1.0000 
1.0000 
1.0000 


0.40 


0.2639 
0.4478 
0.6405 
0.8011 
0.9081 


0.9652 
0.9894 
0.9975 
0.9995 
0.9999 


1.0000 
1.0000 


0.0001 
0.0013 
0.0082 
0.0328 
0.0942 


0.2088 
0.3743 
0.5634 
0.7368 
0.8653 


0.9424 
0.9797 
0.9942 
0.9987 
0.9998 


1.0000 
1.0000 


0.0001 
0.0008 
0.0055 
0.0230 
0.0696 


0.1629 
0.3081 
0.4878 
0.6675 
0.8139 


0.9115 
0.9648 
0.9884 
0.9969 
0.9994 


0.9999 
1.0000 
1.0000 


0.45 


0.1471 
0.2902 
0.4743 
().6626 
0.8166 


0.9174 
0.9699 
0.9914 
0.9981 
0.9997 


1.0000 
1.0000 


0.0000 
0.0003 
0.0025 
0.0120 
0.0411 


0.1077 
0.2258 
0.3915 
0.5778 
0.7473 


0.8720 
0.9463 
0.9817 
0.9951 
0.9990 


0.9999 
1.0000 


0.0000 
0.0002 
0.0015 
0.0077 
0.0280 


0.0777 
0.1727 
0.3169 
0.4940 
0.6710 


0.8159 
0.9129 
0.9658 
0.9891 
0.9972 


0.9995 
0.9999 
1.0000 


0.0717 
0.1662 
0.3145 
0.5000 
0.6855 


0.8338 
0.9283 
0.9755 
0.9936 
0.9988 


0.9999 
1.0000 


0.0000 
0.0001 
0.0007 
0.0038 
0.0154 


0.0481 
0.1189 
0.2403 
0.4073 
0.5927 


O.7597 
0.8811 
0.9519 
0.9846 
0.9962 


0.9993 
0.9999 


0.0000 
0.0000 
0.0004 
0.0022 
0.0096 


0.0318 
0.0835 
0.1796 
0.3238 
0.5000 


0.6762 
().8204 
0.9165 
().9682 
0.9904 


0.9978 
().9996 
1.0000 
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0.3585 
0.7358 
0.9245 
0.9841 
0.9974 


0.9997 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 


Table 1 


BINOMIAL DISTRIBUTION FUNCTION (Continued) 


0.1216 
0.3917 
0.6769 
0.8670 
0.9568 


0.9887 
0.9976 
0.9996 
0.9999 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 


0.0388 
0.1756 
0.4049 
0.6477 
0.8298 


0.9327 
0.9781 
0.9941 
0).9987 
0.9998 


1.0000 
1.0000 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 


0.20 


0.0115 
0.0692 
0.2061 
0.4114 
0.6296 


0.8042 
0.9133 
0.9679 
0.9900 
0.9974 


0.9994 
0.9999 
1.0000 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 


0.25 


0.0032 
0.0243 
0.0913 
0.2232 
0.4148 


0.6172 
0.7858 
0.8982 
().9591 
0.9861 


0.9961 
0.9991 
0.9998 
1.0000 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 


0.30 


0.0008 
0.0076 
0.0355 
0.1071 
0.2375 


0.4164 
0.6080 
0.7723 
0.8867 
0.9520 


0.9829 
0.9949 
0.9987 
0.9997 
1.0000 


1.0000 
1.0000 
1.0000 
1.0000 


0.35 


0.0002 
0.0021 
0.0121 
0.0444 
0.1182 


0.2454 
0.4166 
0.6010 
().7624 
0.8782 


0.9468 
0.9804 
0.9940 
0.9985 
0.9997 


1.0000 
1.0000 
1.0000 
1.0000 


0.40 


0.0000 
0.0005 
0.0036 
0.0160 
0.0510 


0.1256 
0.2500 
0.4159 
().5956 
0.7553 


0.8725 
0.9435 
0.9790 
0.9935 
0.9984 


0.9997 
1.0000 
1.0000 
1.0000 


0.45 


0.0000 
0.0001 
0.0009 
0.0049 
0.0189 


0.0553 
0.1299 
0.2520 
0.4143 
0.5914 


0.7507 
0.8692 
(1.9420 
0.9786 
0.9936 


0.9985 
0.9997 
1.0000 
1.0000 


0.50 


0.0000 
0.0000 
0.0002 
0.0013 
0.0059 


0.0207 
0.0577 
0.1316 
0).2517 
0.4119 


0.5881 
0.7483 
0.8684 
0.9423 
OS793 


0.9941 
0.9987 
0.9998 
1.0000 
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Table 2 


POISSON DISTRIBUTION FUNCTION* 





*Reprinted by kind permission from E. C. Molina, Poisson’s Exponential Binomial 
Limit, D. Van Nostrand Company, Inc., Princeton, N.J., 1947. 
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Table 2 


POISSON DISTRIBUTION FUNCTION (Continued) 


ESTEE ESE EAR EZEAES ESE 





2 1.000 
7 0.999 | 1.000 
2.6 0.999 | 1.000 
2.8 0.998 | 0.999 
3.0 0.996 | 0.999 
3.2 0.994 | 0.998 
3.4 0.992 | 0.997 
3.6 0.988 | 0.996 
3.8 0.984 | 0.994 
4.0 0.979 | 0.992 
4.2 0.972 | 0.989 
4.4 0.964 | 0.985 
4.6 0.955 | 0.980 
4.8 0.944 | 0.975 
5.0 0.932 | 0.968 
5.2 0.918 | 0.960 
5.4 0.903 | 0.951 
5.6 0.886 | 0.941 
5.8 0.867 | 0.929 
6.0 0.847 | 0.916 
2.8 
3.0 
3.2 
3.4 
3.6 
3.8 
4.0 
4.2 
4.4 
4.6 
4.8 
5.0 
5.2 
5.4 
5.6 1.000 
5.8 1.000 
6.0 0.999 | 1.000 
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Table 2 


POISSON DISTRIBUTION FUNCTION (Continued) 


bc ES ERE EE EE EE 


0.002 0.015 0.054 ().259 ().414 0.574 ().716 ().826 (1.902 
















a 0.002 0.012 ().046 ().235 ().384 ().542 ().687 0.803 (0.886 
6.6 (0.001 ().010 ().040 0.213 (1.355 OST ().658 ().780 ().869 
6.8 0.001 ().009 ().034 0.492 (32) (0.480 | 0.628 ().755 (0.850 

7.0 0.001 0.007 ().030 0.173 0.301 ().450 ().599 ().729 ().830 

72 0.001 0.006 0.025 0.156. ().276 0.420 ().569 0.703 0.810 
74 0.001 ().005 0.022 0.140 ().253 0.392 0.539 ().676 ().788 
7.6 0.001 ().004 ().019 ().125 ().231 ().365 0.510 ().648 ().765S 
78 0.000 0.004 ().016 ().112 ().2.10 ().338 ().481 ().620 ().741 
8.0 0.000 0.003 ().014 0.100 0.191 0.313 ().453 ().593 ().717 
8.5 0.000 0.002 0.009 0.074 0.150 ().256 0).386 0.523 ().653 
9.0 0).000 0.001 0.006 0.055 ().116 0.207 ().324 ().456 ().S87 
9.5 0.000 0.001 0.004 0.040 0.089 ().165 ().269 ().392 0.522 
10.0 0.000 0.000 0.003 ) 0.029 0.067 0.130 0.220 ().333 0.458 

9 

6.2 ; ? 0.998 1.000 

6.4 : : 0.997 1.000 

6.6 . ; ().997 ().999 1.000 

6.8 ; : 0.996 0.999 1.000 

7.0 : : 0.994 0.999 1.000 

a2 : : 0.993 0.999 ().999 1.000 

7.4 : ‘ y 0.991 ().998 ().999 1.000 

7.6 ; ; ().989 ().998 ().999 1.000 

7.8 . : ().986 ().997 ().999 1.000 

8.0 : : 0.983 ().996 (1.998 0.999 1.000 
8.5 : h 0.973 ().993 ().997 0.999 (1.999 
9.0 ; ; ().959 0).989 ().995 (1.998 0.999 
9.5 ; . 0.940 ().982 ().991 (1.996 ().998 
10.0 : y 0.917 ().973 0.986 0.993 ().997 
8.5 1.000 

9.0 1.000 

9.5 0.999 1.000 


10.0 0.998 0.999 1.000 
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Table 2 


POISSON DISTRIBUTION FUNCTION (Continued) 


oS SE 





10.5 0.397 

11.0 0.341 

11.5 0.289 

12.0 0.155 _| 0.242 

12.5 0.201 

13.0 0.166 
\ 13.5 0.135 

14.0 0.109 

14.5 0.088 

15.0 0.070 

19 

10.5 0.994 

11.0 0.991 

11.5 0.986 

12.0 0.979 

12.5 0.969 

13.0 0.957 

13.5 0.942 

14.0 0.923 

14.5 0.901 

15.0 0.875 

29 

10.5 

11.0 

11.5 

12.0 

12.5 

13.0 

13.5 

14.0 1.000 

14.5 ; 0.999 | 1.000 


15.0 9S 0.999 | 1.000 
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Table 2 


POISSON DISTRIBUTION FUNCTION (Continued) 


Pat Se EE 


































































().000 0.004 ().010 0.022 0.077 0.275 
. 0.000 0.002 0.005 0.013 ().049 0.201 
18 0.000 0.001 0.003 0.007 0.030 0.143 
19 ().000 0.001 0.002 0.004 ().018 0.098 
20 ().000 (0).( 00 0.001 0.002 ().011 0.066 
21 ().000 0.000 0.000 0.001 0.006 0.043 
22 0.000 ().000 ().000 0.001 0.004 0.028 
23 0.000 0.000 0.000 0.000 0.002 0.017 
24 0.000 0.000 0.000 0.000 0.001 0.011 
25 0.001 0.006 
23 
16 0.963 
17 0.937 
18 0.899 
19 0.849 
20 0.787 
21 0.716 
22 0.637 
23 0.555 
24 0.473 
25 0.394 
53 
16 
17 
18 1.000 
19 0.999 
20 0.997 
21 0.994 
22 0.989 
23 0.981 
24 0.969 
25 0.950 
g 43 
19 
20 
21 
22 
23 1.000 
74 0.999 0.999 
25 0.998 0.999 





487 


Z 


0.0 
0.1 
0.2 
0.3 
0.4 


0.5 
0.6 
0.7 
0.8 
0.9 


1.0 
1.1 
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Table 3 


NORMAL DISTRIBUTION FUNCTION 


_> t? 
F(z) = _ a 
V271 J -© 


0.00 0.01 0.02 0.03 0.04 0.05 0.06 


0.5000 0.5040 0.5080 0.5120 0.5160 0.5199 0.5239 
0.5398 0.5438 0.5478 0.5517 0.5557 0.5596 0.5636 
0.5793 0.5832 0.5871 0.5910 0.5948 0.5987 0.6026 
0.6179 0.6217 0.6255 0.6293 0.6331 0.6368 0.6406 
0.6554 0.6591 0.6628 0.6664 0.6700 0.6736 0.6772 


0.6915 0.6950 0.6985 0.7019 0.7054 0.7088 0.7123 
0.7257. 0.7291 0.7324 0.7357 0.7389 0.7422 0.7454 
0.7580 0.7611 0.7642 0.7673 0.7704 0.7734 0.7764 
0.7881 0.7910 0.7939 0.7967 0.7995 0.8023 0.8051 
0.8159 0.8186 0.8212 0.8238 0.8264 0.8289 0.8315 


0.8413 0.8438 0.8461 0.8485 0.8508 0.8531 0.8554 
0.8643 0.8665 0.8686 0.8708 0.8729 0.8749 0.8770 


1.2 | 0.8849 0.8869 0.8888 0.8907 0.8925 /0.8944/ 0.8962 


Lo 
1.4 


LS 
1.6 
L.7 
1.8 


0.9032 0.9049 0.9066 0.9082 0.9099 0.9115 0.9131 
0.9192 0.9207 0.9222 0.9236 0.9251 ‘0.9265 0.9279 


0.9332 0.9345 0.9357 0.9370 0.9382 0.9394 0.9406 
0.9452 0.9463 0.9474 0.9484 0.9495 0.9505 0.9515 
0.9554 0.9564 0.9573 0.9582 0.9591 0.9599 0.9608 
0.9641 0.9649 0.9656 0.9664 0.9671 _ 0.9678 0.9686 __ 





1.9 | 0.9713 0.9719 0.9726 0.9732 0.9738 0.9744 0.9750 


2.0 
2.1 
pee 
aud 
2.4 


2 
2.6 
| 
2.8 
29 


3.0 
a1 
3.2 
3.3 
3.4 


0.9772 0.9778 0.9783 0.9788 0.9793 0.9798 | 0.9803 
0.9821 0.9826 0.9830 0.9834 0.9838 0.9842 0.9846 
0.9861 0.9864 0.9868 0.9871 0.9875 0.9878 0.9881 
0.9893 0.9896 0.9898 0.9901 0.9904 0.9906 0.9909 
0.9918 0.9920 0.9922 0.9925 0.9927 0.9929 0.9931 


0.9938 0.9940 0.9941 0.9943 0.9945 0.9946 0.9948 
0.9953 0.9955 9.9956 0.9957 0.9959 0.9960 0.9961 
0.9965 0.9966 0.9967 0.9968 0.9969 0.9970 0.9971 
0.9974 0.9975 0.9976 0.9977 0.9977 0.9978 0.9979 
0.9981 0.9982 0.9982 0.9983 0.9984 0.9984 0.9985 


0.9987 0.9987 0.9987 0.9988 0.9988 0.9989 0.9989 
0.9990 0.9991 0.9991 0.9991 0.9992 0.9992 0.9992 
0.9993 0.9993 0.9994 0.9994 0.9994 0.9994 0.9994 
0.9995 0.9995 0.9995 0.9996 0.9996 0.9996 0.9996 
0.9997 0.9997 0.9997 0.9997 0.9997 0.9997 0.9997 


0.07 


0.5279 
0.5675 
0.6064 
0.6443 
0.6808 


0.7157 
0.7486 
0.7794 
0.8078 
0.8340 


0.8577 
0.8790 
0.8980 
0.9147 
0.9292 


0.9418 
0.9525 
0.9616 
0.9693 
0.9756 


0.9808 
0.9850 
0.9884 
OS911 
0.9932 


0.9949 
0.9962 
0.9972 
0.9979 
0.9985 


0.9989 
0.9992 
0.9995 
0.9996 
0.9997 


0.08 


0.5319 
0.5714 
0.6103 
0.6480 
0.6844 


0.7190 
0.7517 
0.7823 
0.8106 
0.8365 


0.8599 
0.8810 
0.8997 
0.9162 
0.9306 


0.9429 
0.9535 
0.9625 
0.9699 
0.9761 


0.9812 
0.9854 
0.9887 
0.9913 
0.9934 


0.9951 
0.9963 
0.9973 
0.9980 
0.9986 


0.9990 
0.9993 
0.9995 
0.9996 
0.9997 


0.09 


N5552 
0.5753 
0.6141 
0.6517 
0.6879 


0.7224 
0.7549 
0.7852 
0.8133 
0.8389 


0.8621 
0.8830 
0.9015 
0.9177 
0.9319 


0.9441 
0.9545 
0.9633 
0.9706 
0.9767 


0.9817 
0.9857 
0.9890 
0.9916 
0.9936 


0.9952 
0.9964 
0.9974 
0.9981 
0.9986 


0.9990 
0.9993 
0.9995 
0.9997 
0.9998 
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Table 4 


Values of t,* 


a= 0.025 





*This table is abridged from Table IV of R. A. Fisher, Statistical Methods for Research Workers, 
published by Oliver and Boyd, Ltd., Edinburgh, by permission of the author and publishers. 
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Table 5 


VALUES OF x2* 


a@=0.975 a@=0.95 @=0.05 a@=0.025 a=0.01 


0.0000393 0.000157 0.000982 0.00393 3.841 
0.0100 0.0201 0.0506 0.103 5.991 
0.0717 0.115 0.216 0.352 7.815 
0.207 0.297 0.484 0.711 9.488 
0.412 0.554 0.831 1.145 11.070 


0.676 0.872 1.237 1.635 12.592 
0.989 1239 1.690 2.167 14.067 
1.344 1.646 2.180 2.733 15.507 
1.735 2.088 2.700 We 16.919 
2.156 2.598 3.247 3.940 18.307 


2.603 3.053 3.816 4.575 19.675 
3.074 S.071 4.404 5.226 21.026 
3.565 4.107 5.009 5.892 22.362 
4.075 4.660 5.629 6.571 23.685 
4.601 5.229 6.262 7.261 24.996 


5.142 5.812 6.908 7.962 26.296 
5.697 6.408 7.564 8.672 27.587 
6.265 7.015 8.231 9.390 28.869 
6.844 _ 7.633 8.907 10.117 30.144 
7.434 8.260 9591 10.851 31.410 


8.034 8.897 10.283 11,591 32.671 
8.643 9.542 10.982 12.338 33.924 
9.260 10.196 11.689 13.091 3o.L72 
9.886 10.856 12.401 13.484 36.415 
10.520 11.524 13.120 14.611 37.652 


11.160 12.198 13.844 15.379 38.885 
11.808 12.879 14.573 16.151 40.113 
12.461 13.565 15.308 16.928 41.337 
13.121 14.256 16.047 17.708 42.557 
13.787 14.953 16.791 18.493 43.773 





*This table is based on Table 8 of Biometrika Tables for Statisticians, Vol. 1, by permission of 
the Biometrika trustees. 
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492 Statistical Tables 


Table 7 


RANDOM DIGITS* 


1306 1189 5731 3968 5606 5084 8947 3897 1636 7810 
0422 2431 0649 8085 5053 4722 6598 5044 9040 §121 
6597 2022 6168 5060 8656 6733 6364 7649 1871 4328 
7965 6541 5645 6243 7658 6903 9911 5740 71824 8520 
7695 6937 0406 8894 0441 8135 9797 7285 5905 9539 
5160 7851 8464 6789 3938 4197 6511 0407 9239 2232 
2961 0551 0539 8288 1478 7565 5581 5771 5442 8761 
1428 4183 4312 5445 4854 9157 9158 5218 1464 3634 
3666 5642 4539 1561 7849 7520 2547 0756 1206 2033 
6543 6799 7454 9052 6689 1946 2574 9386 0304 71945 
9975 6080 7423 3175 9377 6951 6519 8287 8994 5532 
4866 0956 7545 7723 8085 4948 2228 9583 4415 7065 
8239 7068 6694 5168 3117 1568 0237 6160 9585 1133 
8722 9191 3386 3443 0434 4586 4150 1224 6204 0937 
1330 9120 8785 8382 2929 7089 3109 6742 2468 7025 
2296 2952 4764 9070 6356 9192 4012 0618 2219 1109 
3582 7052 3132 4519 9250 2486 0830 8472 2160 7046 
5872 9207 7222 6494 8973 3545 6967 8490 5264 9821 
1134 6324 6201 3792 5651 0538 4676 2064 0584 7996 
1403 4497 7390 8503 8239 4236 8022 2914 4368 4529 
3393 7025 3381 3553 2128 1021 8353 6413 5161 8583 
1137 7896 3602 0060 7850 1626 0854 6565 4260 6220 
7437 5198 8772 6927 8527 6851 2709 5992 7383 1071 
8414 8820 3917 7238 9821 6073 6658 1280 9643 7761 
8398 5224 2749 7311 5740 9771 7826 9533 3800 4553 
0995 8935 2939 3092 2496 0359 0318 4697 7181 4035 
6657 0755 9685 4017 6581 7292 5643 5064 1142 1297 
8875 8369 7868 0190 9278 1709 4253 9346 4335 3769 
8399 6702 0586 6428 7985 2979 4513 1970 1989 3105 
6703 1024 2064 0393 6815 8502 1375 4171 6970 1201 
4730 1653 9032 9855 0957 7366 0325 5178 7959 5371 
8400 6834 3187 8688 1079 1480 6776 9888 7585 9998 
3647 8002 6726 0877 4552 3238 7542 7804 3933 9475 
6789 5197 8037 2354 9262 5497 0005 3986 1767 7981 
2630 2721 2810 2185 6323 5679 4931 8336 6662 3566 
1374 8625 1644 3342 1587 0762 6057 8011 2666 3759 
1519 7625 9110 4409 0239 7059 3415 5537 2250 7292 
9678 2877 1579 4935 0449 8119 6969 5383 L717 6719 
0882 6781 3538 4090 3092 2365 6001 3446 9985 6007 
0006 4205 2389 4365 1981 8158 1784 6256 3842 5603 
4611 9861 7916 9305 2074 9462 0254 4827 9198 3974 
1093 3784 4190 6332 1175 8599 9735 8584 6581 7194 
3374 3545 6865 8819 3342 1676 2264 6014 5012 2458 
3650 9676 1436 4374 4716 5548 8276 6235 6742 2154 
7292 5749 71977 7602 9205 3599 3880 9537 4423 2330 
2353 8319 2850 4026 3027 1708 3518 7034 7132 6903 
1094 2009 8919 5676 7283 4982 9642 9235 8167 3366 
0568 4002 0587 7165 1094 2006 7471 0940 4366 9554 
5606 4070 5233 4339 6543 6695 5799 58?" 3953 9458 
8285 7537 1181 2300 5294 6892 1627 3372 1952 3028 


*From Donald B. Owen, Handbook of Statistical Tables. Reading, Mass.: Addison Wesley, 1962. 
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2444 
5748 
7761 
6838 
6440 


8829 
9845 
5072 
9035 
5562 


2757 
6397 
9208 
2418 
7300 


6870 
2914 
0868 
7994 
8587 


8016 
5581 
2015 
127 
3589 


2188 
2924 
1961 
293 
7585 


0197 
9719 
8866 
5744 
7149 


7710 
5246 
3576 
3026 
6591 


0402 
7105 
7181 
3655 
$121 


3618 
0166 
6187 
1234 
8949 


9039 
7767 
8583 
9280 
2631 


0094 
4796 
9000 
0553 
1080 


2650 
1848 


7641 | 


9289 
9006 


2535 
7309 
8657 
6579 
0764 


8345 
5775 
3364 
8987 
1991 


6454 
0472 
1669 
9637 
4424 


4021 
5336 
4785 
S957 
5721 


8479 
5651 
0013 
1464 
4365 


3257 
7654 
4140 
3282 
6717 


3098 
3638 
9653 
1448 
8731 


4803 
2800 
0852 
2654 
3033 


2887 
2951 
3887 
| Bs 
2222 


8727 
1476 
3575 
6120 
5659 


8916 
4045 
8118 
6461 
6687 


2257 
7517 
6688 
5741 
1764 


7219 
9878 
2219 
0410 
2648 


9207 
soa2 
6007 
8786 
1389 


9345 
0432 
7381 
2671 
8717 


0480 
4745 
1046 
2178 
3117 


9208 
4947 
3658 
0276 
4875 


Table 7 


RANDOM DIGITS (Continued) 


8568 
6289 
5619 
0812 
9167 


3802 
4449 
5739 
2600 
0336 


3953 
0787 
4279 
8141 
8258 


3245 
Bey 
6340 
2292 
9150 


5084 
9076 
2631 
0102 
8355 


1215 
7966 
1113 
7536 
6728 


Ee a 
8537 
8006 
9023 
6581 


773 
8644 
0124 
4691 
2365 


5038 
4482 
0885 
8134 
1901 


7429 
1414 
4730 
7290 
5724 


1590 
2814 
6864 
3988 
4998 


5497 
1999 
7920 
3828 
1411 


9579 
4990 
1282 
3908 
3662 


2256 
3195 
9452 
9554 
1642 


8004 
4699 
2152 
1173 
9684 


0473 
2491 
9175 
0972 
2233 


9212 
2980 
9043 
1472 
7196 


9086 
6341 
8559 
0353 
5686 


1998 
8471 
1264 
3291 
5184 


1578 
4799 
1652 
1667 
2962 


2420 
8281 
8506 
2146 
7036 


0318 
2691 
6074 
8197 
0303 


2442 
4666 
1840 
eee i 
0332 


4350 
4166 
7460 
8309 
2050 


7949 
8313 
2220 
7350 
9423 


6589 
5662 
0260 
ALS 
3518 


7017 
8252 
4109 
7275S 
7072 


1202 
9661 
9813 
5289 
8377 


(2935 


1424 
7755 
7262 
6467 


1917 
9189 
8096 
2823 
1182 


2547 
1549 
9643 
5095 
0133 


5168 
5328 
4715 
8852 
7424 


8041 
1208 
5999 
3590 
5369 


6064 
0878 
3291 
5036 
4934 


3205 
8401 
1637 
7080 
7101 


2355 
5635 
4046 


0708 


7267 


8060 
4971 
5570 
1014 
6360 


8845 
2361 
9080 
8754 
8675 


1306 
2031 
1653 
8229 
8954 


7927 
2459 
8288 
3755 
2930 


2470 
9319 
7763 
0150 
7428 


6377 
7674 
3681 
9092 
3713 


9869 
2769 
1806 
2317 
3640 


2438 
5184 
5778 
0974 
0027 


3972 
7147 
8333 
7420 
1063 


9579 
2789 
8142 
1935 
1732 


6216 
0110 
9249 
1104 
3084 


3163 
8377 
6984 
2442 
9798 


1190 
7803 
8924 
2866 
3884 


2696 
5056 
9368 
5642 
7539 


8179 
3341 
9611 
8043 
9702 


9216 
7004 
vg | 
8027 
9278 


2887 
3922 
7809 
8975 
0563 


2002 
6680 
1167 
goL7 
1376 


7640 
9416 
4838 
1847 
4151 


7004 
2564 
4432 
1143 
1926 


1942 
6209 
9905 
0832 
7009 


T9337 
8673 
0926 
1799 
7745 


2406 
4367 
5822 
7065 
0279 


3704 
5982 
5531 
4854 
4526 


4617 
1192 
1289 
9079 
1376 


2802 
6212 
2701 
6144 
1918 


3933 
1158 
5885 
4593 
7939 


1272 
2655 
0312 
8293 
5040 


3478 
7184 
5699 
0741 
4875 


6209 
1249 
2664 
1704 
3833 


6817 
1556 
2132 
7680 
0239 


6163 
6098 
2169 
8983 
6360 


2596 
6816 
4401 
4806 
8635 


0833 
6154 
7788 
8844 
7252 
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4357 
5339 
6583 
6564 
4811 


6931 
8755 
6742 
6655 
8514 


8135 
4414 
3727 
5434 
7195 


2705 
1547 
3424 
8969 
S225 


6432 
3085 
0264 
8710 
5736 


7529 
5133 
3170 
3024 
4398 


0082 
4351 
3268 
4391 
7328 


3835 
8731 
2995 
5597 
3081 


7406 
5969 
4765 
3219 
6906 


7993 
2549 
3672 
p8 W 
3162 


4146 
7325 
8433 
3526 
1933 


7236 
3390 
2260 
3930 
4806 


5004 
6855 
7959 
7342 
8828 


8245 
8981 
1435 
7551 
8720 


9861 
5903 
1246 
2419 
9001 


L339 
7993 
9915 
0680 
3121 


5419 
6516 
0086 
8487 
0705 


2938 
4980 
7868 
9028 
5876 


4439 
9442 
9647 
2532 
8859 


3141 
3737 
7033 
0293 
9968 


8353 
6862 
0717 
2171 
3763 


1230 
6120 
3443 
9014 
4124 


7299 
0127 
5056 
0314 
9869 


6251 
4972 
1354 
3695 
8898 


1516 
8319 
3687 
6065 
3132 


4802 
8030 
6960 
1127 
7749 


7659 
6814 
7580 
4884 
0652 


2671 
8674 
0683 
5660 
8150 


5683 
7696 
4364 
7577 
5044 


0103 
7686 
4844 
3978 
6369 


Table 7 


RANDOM DIGITS (Continued) 


9952 
1584 
0606 
3809 
6265 


0441 
7825 
0190 
6032 
9286 


8981 
5489 
5983 
7525 
2785 


9611 
1280 
7631 
4915 
2478 


2849 
2744 
9159 
0036 
4521 


5751 
7408 
2621 
8088 
8191 


2061 
5898 
1337 
1488 
9424 


4691 
4506 
3768 
5006 
1360 


6877 
7510 
1037 
2815 
8826 


4528 
0723 
0149 
5933 
1258 


8004 
8634 
9284 
3428 
8931 


4013 
9005 
9278 
7574 
0449 


4689 
5157 
8021 
0067 
3186 


1077 
4286 
7260 
7921 
3342 


2539 
0814 
6995 
9650 
9973 


3785 
2186 
6718 
0200 
2087 


2506 
3973 
3884 
2249 
7082 


0559 
7262 
0625 
8325 
1868 


2920 
1620 
4975 
8696 
6218 


7988 
4505 
7412 
1032 
0416 


7945 
3485 
2759 
5523 
0649 


1352 
7012 
1816 
1685 
5051 


1950 
6386 
0204 
2800 
8375 


0641 
5678 
7361 
2913 
9200 


2208 
7318 
6565 
2027 
5070 


7125 
0725 
4059 
5868 
8270 


7573 
8103 
5679 
6661 
8579 


8382 
8127 
9887 
9677 
9265 


9588 
4973 
1998 
9248 
3206 


4635 
6841 
6370 
5192 
4326 


1530 
2278 
1888 
9078 
8085 


6563 
1643 
7697 
5258 
4772 


ged h 
7492 
7616 
6292 
7414 


0195 
0338 
0151 
3840 
8836 


4595 
8619 
3949 
6042 
8078 


4922 
5554 
9919 
0084 
$233 


Liss 
3616 
4830 
5774 
5647 


2825 
2022 
7060 
2169 
3217 


3002 
1911 
1359 
9410 
9034 


8478 
1379 
1884 
1732 
7840 


5207 
5832 
2889 
0648 
6177 


1499 
9934 
7933 
3100 
4651 


2201 
3736 
4325 
4706 
7232 


7024 
8096 
8903 
9031 
7269 


8616 
7614 
1012 
5467 
4150 


8877 
5664 
1007 
6362 
3980 


3979 
2049 
4509 
7205 
5571 


4928 
2178 
0514 
3196 
8465 


2869 
1288 
1346 
9282 
0843 


9094 
6460 
0717 
2137 
6525 


4730 
0612 
0285 
7768 
4450 


7332 
4044 
0067 
5358 
0038 


8344 
7164 
7454 
3454 
0401 


6202 
8284 
9056 
9747 
2992 


6170 
3265 
0179 
1839 
2276 


9530 
6791 
6469 
6808 
6774 


2309 
7843 
9587 
2717 
9667 


S379 
7463 
0034 
0357 
7502 


3746 
6160 
6125 
6572 
9832 


9077 
1869 
5740 
9357 
2608 


1967 
8118 
2765 
3326 
2139 


3068 
7022 
2906 
1929 
1580 


3852 
0498 
5039 
6881 
2483 


3899 
7010 
8684 
9735 
6284 


5865 
3999 
0059 
eet i | 
5059 


6499 
9677 
5410 
3727 
8522 


0811 
0568 
2184 
7030 
8555 


8635 
4842 
8600 
7811 
6458 


3690 
9797 
5078 
3940 
2703 


5306 
5700 
8477 
5941 
S250 
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1758 
6430 
4893 
1516 
4950 


0549 
1018 
2241 
1602 
5840 


1676 
6048 
5549 
5317 
pa A 


2300 
1499 
6201 
6839 
0092 


1862 
9886 
5289 
2685 
6055 


4092 
5951 
7687 
9886 
3193 


9181 
9459 
9874 
7729 
4699 


1872 


9636: 


6403 
4433 
2361 


4077 
6678 
6499 
6663 
9999 


9048 
5136 
9906 
4198 
2030 


1489 
8803 
8857 
2733 
3171 


6775 
7027 
9965 
0708 
8381 


0367 
4175 
9621 
4584 
7784 


5412 
8699 
3791 
9736 
Io 


32553 
6744 
9071 
7104 
7092 


4995 
4937 
3849 
3463 
3011 


7365 
2175 
5278 
2099 
8102 


7244 
0123 
9054 
3293 
3933 


8463 
0006 
2582 
9021 
4503 


6982 
9653 
1070 
7035 
5878 


2774 
0478 
1717 
7326 
5756 


9360 
7569 
9729 
2201 
1549 


7484 
8940 
2563 
9418 
6469 


3106 
4534 
2946 
8312 
1933 


6515 
3097 
1231 
7193 
4786 


0280 
3670 
4821 
2055 
8899 


9135 
5728 
2849 
7513 
3001 


3954 
2438 
7632 
2314 
8026 


6580 
3686 
5207 
0319 
6105 


3845 
3654 
5693 
8182 
8989 


Table 7 


RANDOM DIGITS (Continued) 


6033 
4157 
1533 
8674 
3036 


6639 
7549 
7092 
9848 
9902 


1595 
9029 
0515 
4600 
4793 


4877 
5367 
2863 
8068 
3186 


6299 
8894 
0651 
5506 
6847 


9371 
$797 
2373 
4872 
6721 


1669 
4933 
3163 
2774 
7947 


7422 
L757 
7469 
7431 
0431 


6983 
8478 
4627 
7908 
Jo25 


6865 
2863 
3012 
6270 
6789 


9813 
5626 
6572 
9233 
9047 


0990 
2539 
4891 
6241 
6935 


5693 
8306 
0560 
0640 
5957 


6936 
937 
5684 
7339 
8482 


2229 
0446 
9109 
2993 
4543 


3278 
5030 
1157 
2702 
0086 


2007 
0111 
6372 
6030 
1659 


2688 
4204 
8973 
2389 
8012 


0181 
9187 
0456 
1241 
1068 


9029 
5565 
1218 
6461 
4359 


1052 
1603 
8408 
1799 
8719 


0037 
2315 
9239 
1084 
3681 


3008 
8892 
9021 
9668 
4123 


4109 
2701 
3517 
3395 
6680 


2219 
3494 
7448 
7028 
7448 


3503 
6524 
4208 
1807 
2623 


7784 
6703 
2600 
8260 
6571 


8649 
1650 
3332 
4094 
3214 


3327 
2a 
4245 
O97] 
6272 


8700 
8923 
7309 
2079 
1820 


1816 
1339 
2173 
5281 
8498 


7309 
8030 
0738 
8142 
6420 


9816 
4127 
0632 
6379 
6555 


8060 
2587 
7448 
9359 
2656 


9145 
8211 
2228 
4830 
2017 


4496 
2265 
3623 
9056 
7977 


6363 
1234 
9887 
9023 
1969 


0156 
2486 
0294 
5062 
8927 


6812 
9032 
3583 
7042 
7036 


0349 
5596 
4361 
2998 
5063 


7484 
4666 
4754 
0797 
1312 


4702 
7663 
1804 
8555 
0214 


7311 
1709 
4309 
6515 
3237 


1896 
2521 
2221 
3416 
1864 


7511 
1723 
9700 
3866 
4114 


8642 
7748 
9399 
8576 
7578 


6913 
2410 
7060 
1368 
TIS2Z 


1965 
0002 
5062 
0118 
1355 


1755 
9852 
8996 
6923 
0200 


3416 
8389 
3041 
5507 
9199 


1699 
1207 
0272 
0885 
7124 


0812 
3881 
3025 
1291 
8489 


6162 
4043 
4044 
6310 
6915 


6881 
2159 
8991 
6169 
4535 


2146 
6138 
0224 
8698 
8385 


5388 
7875 
7349 
4237 
4024 


6017 
1620 
3919 
9513 
7356 


5012 
4724 
0303 
0046 
0585 


3387 
1450 
1006 
2539 
6291 


8236 
9927 
4327 
9605 
7731 


7350 
2135 
1305 
0947 
4787 


4195 
8264 
1030 
5016 
S911 


1024 
6591 
7010 
7916 
6960 


7028 
6991 
7505 
5484 
2193 


4962 
3181 
4595 
0277 
3625 


1831 
6976 
6663 
8757 
1997 


6588 
4859 
LLi1 
6122 
1062 


2461 
7412 
7315 
6070 
7638 


4569 
7940 
5839 
2103 
2841 


1129 
9092 
8423 
6734 
6337 
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Table 10 


CRITICAL VALUES OF D* 


Sample 
size 





* Adapted from F. J. Massey, Jr., “The Kolgomorov-Smirnov test for goodness of fit,” J. Amer. 
Statist. Ass., Vol. 46 (1951), p. 70, with the kind permission of the author and publisher 
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01 


02 


Table 11 


VALUES OF Z =3 In 


.03 


.04 


OS 





07 


.08 


09 





r .00 
0.0 0.000 
0.1 0.100 
0.2 0.203 
0.3 0.310 
0.4 0.424 
0.5 0.549 
0.6 0.693 
0.7 0.867 
0.8 1.099 
0.9 1.472 


0.010 
0.110 
0.213 
0.321 
0.436 
0.563 
0.709 
0.887 
L127 
Loze 


0.020 
0.121 
0.224 
D332 
0.448 
0.576 
0.725 
0.908 
1.157 
1.589 


0.030 
0.131 
0.234 
0.343 
0.460 
0.590 
0.741 
0.929 
1.188 
1.658 


0.040 
0.141 
0.245 
0.354 
0.472 
0.604 
0.758 
0.950 
L.2ek 
1.738 


0.050 
0.151 
0.255 
0.365 
0.485 
0.618 
0.775 
0.973 
1.256 
1.832 


0.060 
0.161 
0.266 
0.377 
0.497 
0.633 
0.793 
0.996 
1.293 
1.946 


0.070 
0.172 
0.277 
0.388 
0.510 
0.648 
0.811 
1.020 
1.333 
2.092 


0.080 
0.182 
0.288 
0.400 
0.523 
0.662 
0.829 
1.045 
1.376 
2.298 


0.090 
U.192 
0.299 
0.412 
0.536 
0.678 
0.848 
1.071 
1.422 
2.647 


For negative values of r put a minus sign in front of the corresponding Z’s, and vice versa. 
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Table 12 (a) 


VALUES OF ry FOR a= 0.05* 





*This table is reproduced from H. L. Harter, ‘“‘Critical values for Duncan’s new multiple range 
test.’ It contains some corrected values to replace those given by D. B. Duncan in ‘‘Multiple 
Range and Multiple F Tests,’ Biometrics, Vol. 11 (1955). The above table is reproduced with 
the permission of the author and the editor of Biometrics. 
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Table 12 (b) 


VALUES OF 7, FOR a= 0.01* 





*This table is reproduced from H. L. Harter, “Critical values for Duncan’s new multiple range 
test.’ It contains some corrected values to replace those given by D. B. Duncan in ‘‘Multiple 
Range and Multiple F Tests,” Biometrics, Vol. 11 (1955). The above table is reproduced with 
the permission of the author and the editor of Biometrics. 
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Table 13 


CONTROL CHART CONSTANTS* 


Chart for standard deviations 


Factor 
for 
central 




















Chart for ranges 


Chart for averages 


Factors for 
control limits 






Number of 
observations 
in 
sample, n 







Factor 
for 
central 









Factors for 
control limits 


Factors for 
control limits 






























































































2 ().5642 
3 0.7236 Pa 
4 0.7979 2.202 
5 0.8407 ZALS 
6 ().8686 | 0.026 0.030 0) 2.004 
7 0.8882 | 0.105 0.118 0.076 | 1.924 
8 0.9027 | 0.167 0.185 0.136 | 1.864 
9 US139 | O.219 ().239 0.184 | 1.816 
10 (1.308 | 0.9227 | 0.262 ().284 223 | led T7 
Li (1.285 | 0.9300 | 0.299 ().321 ().256 | 1.744 
12 0.266 | 0.93359 | 0.331 ().354 ().284 | 1.716 
13 0.249 | 0.9410 | 0.359 0.382 0.308 | 1.692 
14 0.235 | 0.9453 | 0.384 0.406 0.329 | LST 
LS 0.223 | 0.9490 | 0.406 0.428 0.348 | 1.652 


*This table is reproduced, by permission, from the ASTM Manual on Quality Control of Materials, 
American Society for Testing and Materials, Philadelphia, Pa., 1951. 
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Table 14 


160.193 188.491 242.300 
18.930 22.401 29.055 
9.398 11.150 14.527 
6.612 7.855 10.260 
5.337 6.345 8.301 
4.613 5.488 7.187 
4.147 4.936 6.468 
3.822 4.550 5.966 
3.582 4.265 5.594 
3.997 4.045 5.308 
3.250 3.870 5.079 
3.130 S.12t 4.893 
3.029 3.608 4.737 
2.945 3.507 4.605 
2.872 3.421 4.492 
2.808 3.345 4.393 
2193 3.219 4.307 
2.703 ee dk 4.230 
2.659 3.168 4.161 
2.494 2.972 3.904 
2.385 2.841 3.733 
2.306 2.748 3.611 
2.247 2.677 3.518 
2.200 2.621 3.444 
2.162 2.576 3.385 
2.130 2.538 3.335 
2.103 2.506 3.293 
2.080 2.478 S257 
2.060 2.454 R225 
2.042 2.433 3.197 
2.026 2.414 3.173 
2.012 2.397 3.150 
1.999 2.382 3.130 
1.987 2.368 3.112 
LOTT pa 3.096 
1.905 2.270 2.983 
1.865 2222 2.921 
1.839 2.191 2.880 
1.820 2.169 2.850 
1.794 2.138 2.809 
LaT? 2a. 07 2.183 
1.764 2.102 2.763 
1.755 2.091 2.748 
1.747 2.082 2.736 
1.741 2.075 2.726 
1.736 2.068 2.718 
1.645 1.960 2026 





*Adapted by permission from Techniques of Statistical Analysis by C. Eisenhart, M. W. Hastay, 
and W. A. Wallis. Copyright 1947, McGraw-Hill Book Company, Inc. 
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Table 15: 


SAMPLE SIZE CODE LETTERS FROM MIL-STD-105 D 


General inspection levels 















Lot or batch size 





YOR wr> 










151 


a or Go 





501 to 1,200 
1,201 to 3,200 
3,201 to 10,000 

10,001 to 35,000 








35,001 to 150,000 
150,001 to 500,000 
500,001 and over 


Z2r ASR OUD 





Ow2 stan 








Answers to 
Odd-Numbered 
Exercises 





CHAPTER 2 
PAGE 13 
7. 6,561. 
9. (b) Bis the event that three graduate assistants are present, C is the 


event that as many graduate assistants as professors are present, 
D is the event that three persons (professors or graduate assis- 
tants) are present; 
(c) (1,1), C, 2), (2, 1), (2, 2); at most two graduate assistants are 
present; 
(d) Yes. 
11. (a) 2; (b) 7; (c) 1 and 4; (d) 1, 2, 3, and 5; (e) 8. 
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509 Answers to Odd-Numbered Exercises 


PAGE 27 


1. 


@(0), IA D’(20), TA D(10), 1’ A D(5), I’ A D'(465), 1(30), D(15), 
1'(470), D'(485), IAD’) U (I 0 DY25), IA D) U (I A D')(475), 
IU D(35), I’ U D'(490), D U 1'(480), TU D’(495), S (500). 


3. Claim should be questioned, since M(RM P’) + N(R P) + 
N(R’ OP)+ N(R VU P)'] = 430. 
5. (a) Yes; (b) No, sum exceeds 1; (c) No, P(C) < 0; (d) No, sum is less 
than 1; (e) Yes. 
7. (a) 0.50; (b) 0.40; (c) 0.50; (d) 0.22; (e) 0.25; (f) 0.20. 
Hi, 0.75 = P = 0,80, 
13. Inconsistent, since P(A U B) = 1/3 + 1/4 > 1/2. 
15. (a) 1/26; (b) 1/2; (c) 4/13; (d) 1/13. 
17. (a) 84/150; (b) 99/150; (c) 43/150; (d) 63/150. 
19. (a) 0.94; (b) 0.21; (c) 0.86; (d) 0.59. 
21. (a) 0.29; (b) 0.18. 
23. 0.0024. 
PAGE 40 
1. P(J| D) = 2/3, PU| D’) = 4/97. 
3. (a) 62/85; (b) 74/84; (c) 29/51. 
5. (a) 13/30; (b) 1/2; (c) 7/16. 
9. (a) 1/4; (b) 25/102. 
11. P(E, OP, OC) =007, PE, Pe)P(C,) —0072, dependent 
events. 
13. (a) 1/20; (b) 0.0768; (c) 0.025. 
15. (a) 0.885; (b) 0.115. 
17. 6/17. 
19. (a) 0.781; (b) 0.219. 
21. Industrial sabotage, P(sabotage| explosion) = 0.344. 
PAGE 47 
1. $0.14; $0.22. 
3. (a) Yes; (b) equally profitable; (c) more profitable to test. 
5. $1.20. 
7. (a) P(Winning) < 1/3; (b) P(Winning) > 1/3. 
9. (a) First job, since E, = $165,000 and E, = $135,000; 
(b) Second job so that he will at least have a chance. 
11. np=m(1 — p);p = 3/4. 
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13. 


(a) FF costs $6.25, LWR costs $10.53/KW/year of useful life; 

(b) FF costs more than 1.68 times LWR fuel; 

(c) LWR now costs $11.40/KW/year, FF fuel cost must now exceed 
1.82 times LWR fuel cost. 


CHAPTER 3 


PAGE 63 


1. 


3s 
9. 


11. 


13. 
15. 
17. 
19. 
21. 


f (0) = 1/20, f(1) = 2/20, f(2) = 3/20, f(3) = 4/20, f(4) = 4/20, 
7S) = 3/20, 76) = 2720, 7) = 1720. 

(a) Yes; (b) Yes; (c) No, f(3) < 0; (d) Yes; (e) No, >) f(x) < 1. 
(a) 0.3169; (b) 0.1442; (c) 0.0861; (d) 0.0746; (e) 0.4862; (f) 0.7052. 


(a) ( : ).2)°(8)' — 0.1468: (b) 0.1468. 


(a) 0.3412; (b) 0.9804; (c) 0.1184. 

(a) 0.5367; (b) 0.2059. 

(a) 102/253; (b) 459/1012; (c) 145/1012. 
(a) 11/21; (b) 44/105; (c) 2/35. 

(a) 0.117; (b) 0.116, error = 0.85%. 


PAGE 76 


SS 


(a) 1.0; (b) 1.0. 

w=A4n+ 1), a? = z(v’ — 1). 

= 32." = Tas. 

Chebyshev Binomial 

1.0000 0.4544 
0.2500 0.0768 
0.1111 0.0042 

5,625. 


m= WN => 


PAGE 86 


(a) 0.173; (b) 0.091; (c) 0.809. 
= 3, FP = U2. 
(a) 0.191; (b) 0.874; (c) 0.473. 
0.007. 
(a) 0.981; (b) 0.577; (c) 0.978. 
0.024. 
(a) 1.5; (b) 0.9; (c) 3 min. 
Savings are $68.56; not worthwhile. 
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PAGE 90 


1. 0.058. j 


. fda) 
is) 


(i), (b) 0.195. 


n 


PAGE 94 


1. (a) 00-40, 41-77, 78-93, 94-98, 99. 
3. (a) 0000-2465, 2466-5917, 5918-8334, 8335-9462, 
9463-9857, 9858-9968, 9969-9994, 9995-9999. 


CHAPTER 4 


PAGE 102 


3. (a) 0.30; (b) 0.50; 

(c) F(x) =0 (x <0), FX) = 4x7 0< x < 1), 
F(x) = 2x — 4x? —1(1<x< 2), Fa) =1@ 22); 

(d) 0.02; (e) 0.84. 

5.~ (a) 5/9; (b) 9/100; (c) f(x) = 0 (x < 2), f(x) = 8x73 (x > 2). 

7. (a) 0.393; (b) 0.148; (c) 0.223. 

9, p= 3/3, 07 = 19/320, 

11. yw = 4, o* does not exist. 


PAGE 110 


1. (a) 0.9332; (b) 0.1151; (c) 0.0154; (d) 0.9599; 
(e) 0.4987; (f) 0.1018; (g) 0.2789; (h) 0.9093. 

3. (a) 0.6826; (b) 0.9544; (c) 0.9974. 
5, (a) 0.3156; (b) 0.1492; (c) 0.5762; (d) 0.0606. 
7. 0.1292. 
9, 0.8093. 

11. (a) 0.0062; (b) 0.1587; (c) 0.8664. 

13. 83.14%. 

15. w= 2.984. 
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PAGE 123 


1. 


Ve 
9. 
11. 
13. 
1, 
19. 





—¢ 
(a) u= 4,0 =./8 ; (b) 0.594. 
n = 25, maximum profit = $1235. 
(a) e-™; (b) e™; (c) f@) =0(¢< 0), f@) = ae (> 0). 
(a) 0.362; (b) 0.067. 
0.264. 
(a) 750 minutes; (b) 0.73786 


F(x) = 0 (% <a), FQ) =F @ <x <A), FX) =1 (2B). 


PAGE 130 


1. 


eee 


(a) 1/4; (b) 1/8; 
(c) F(X1, x.) = 0 (x, <0, x, < 0), 
F (x1, X2) = 4xjx3 (0 < x, < 1,0 < x, < 2), 
F(x;,%.) =1(%, 21,x, > 2), 
F (X44 Xs) = X*, (0 =< Xy =< 1, xX > 2); 
F(X1, X2) = 4x3 (x, > 1,0 < x, < 2). 
(a) f(x, |0.25) =x, +40 <x, < 1); 
_ xX, + 2x, 
(b) f(%,|%1) = es 
(a) 0.3264; (b) 0.472. 
2 
= LW, o* = 7(a*W? + BL? 4+ za7b?). 


CHAPTER 5 


PAGE 146 


15. 


No. The volume, not the height and width, should be doubled. 


PAGE 156 


3. 


ce 
9. 
11. 
13. 
15. 


(a) x = 103.7, claim seems reasonable but is hardly proved; 
(bo). o= 11.1, 67%. 

(a) ¥ = 459.2; (b) s? = 953.79; (c) C.V. = 6.73%. 

(a) x = 12.92, s = 8.92; (b) impossible, open class. 

Y= 2,3, 8 = 3,16, 

(a) 11.4; (b) 27.45. 

(a) 73; (b) 8.02%. 
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CHAPTER 6 


PAGE 170 


7. 


13. 
IS, 


(a) Becomes 1/3 as large; (b) becomes 2/3 as large; 

(c) becomes 5 times as large. 

0.6284. 

A process which is “in control” will be judged “out of control” with a 
probability less than 0.0013. 


PAGE 178 


os Ee ale 


t = 3.213 t.o95 = 2.797 (24 d.f.); information does not support claim. 
t = 4.743 t.o95; = 3.250 (9 d.f.); process is out of control. 

0.94. 

0.02. 


CHAPTER 7 


PAGE 191 


E = 1.48 beats per minute. 

(a) E = 0.84; (b) 11.97 < pw < 13.49. 
E = 0.33. 

= 171: 

E = 1.16 minutes. 

E= 5.2, 

135,639 < yw < 157,745. 

(a) 835; (b) 727; (c) 901. 


PAGE 207 


1. 
7 


9. 
11. 
13. 


(a) «= O.11; (b) B = 0.11. 

(b) 8 = 0.800 for uw = 0.5, B = 0.255 for uw = 1.0, B = 0.015 for 
p= 1.5. 

(a) 41; (b) 40. 

(a) 0.91; (b) 0.78; (c) 0.55; (d) 0.31; (e) 0.13; (f) 0.04. 

(a) 0.60; (b) 0.31; (c) 0.12; (d) 0.03; (e) 0.00. 
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PAGE 221 


1. 


(a) Ly — Ld, <0, buy radial tires only if H, can be rejected; 
(b) “, — &, > O, buy radial tires unless H, can be rejected. 
z= 2.07, teject Hy. 
t = 1.48, cannot reject Ho. 
(a) z = 1.15, cannot reject Hy; (b) 0.04. 
t = 3.09, reject the claim. 
(a) uw = 0.006, o = 0.0036; (b) 0:0475. 
n= 2), 
t = 1.83, cannot reject Hp. 
t = 4.16, reject H, (substantiates claim). 
|t]| = 2.21, cannot reject Ho. 
= “ [(n, — lst + (2 — I)sz] (m1 + nz) 
(xX, — X,) + ee 
where t,,. has nm, + n, — 2 df. 


CHAPTER 8 


PAGE 231 


pom 


a lee a 


(a) s = 4.2; (b) 4.3. 
(a) 3.29; (b) 3.35. 
3.56 < a* < 72.80. 
6.56 <a < 10.59. 
Do = O <— 5.17. 
Lal <@ = 20), 


PAGE 238 


el dae sal 


13. 


vy? = 5.83, cannot reject Ap. 

g=3.51, reject H,. 

vy? = 19.21, cannot reject Ap. 

z = —2.10, reject H, (inspector not making satisfactory measure- 
ments). 

F = 1.20, cannot reject H, (t-test justified). 

F = 1.51, cannot reject H, (claim not substantiated). 

(a) No, variances are unequal; (b) base test on logarithms of the 
observations. 
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CHAPTER 9 


PAGE 249 


Ts 
9. 
11. 
13. 
15. 


0.35 < p < 0.49 

(a) 0.52 < p < 0.64; 

(b) 0.521 < p < 0.633. 

(a) 0.49 <p < 0.63; 

(b) 0.491 < p < 0.629. 

E = 0.116. 

(a) n = 1068; (b) nm = 1025. 

0.520 < p < 0.631. 

p < 0.0232. 

(a) 0.40, 0.56, 0.04; (b) 0.598, 0.133, 0.269. 
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z = 2.19, reject Hy (p = 0.30) in favor of alternative p > 0.30. 
z = —1.83, cannot reject Hy. 

B(4; 13, 0.60) = 0.0321, reject Ap. 

(b) 0.0345 

z = —0.64, difference not significant. 

z = —2.91, reject Hy; yes. 

(a) x? = 34.9, significant difference; (b) z = —5.91. 

e = 1.76, Teject:.A,. 

x* = 7.10, differences not significant. 

x? = 9.39; yes, proportions not the same. 


PAGE 265 


Sa ale le 


2 


.46, cannot reject Ho. 

0.20, cannot reject Ho. 

= 22.6, reject the claim. 

es 

x? = 9.16, good fit. 

(a) 0.0313, 0.1071, 0.2223, 0.2811, 0.2163, 0.1013, 0.0289; 

(tO) 2.5, 810,.17.8, 22.5, 17 dy Bely 2.0! 

(c) x* = 1.454, good fit; since x, s, and }) f; were needed from the 
data, d.f. = k — 3. 


2 


2 


eR RR 
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CHAPTER 10 


PAGE 279 


1. 
3 
9. 
11. 
13. 
15. 
17. 


z= 3,91, reject A): 

P (9 or more successes in 15) = 0.3036, cannot reject Ho. 
z = —0.10, cannot reject Hp. 

z = 2.92, difference is significant. 

z = 0.10, cannot reject Ho. 

H = 1.53, cannot reject Ap. 

H = 26.1, populations not identical. 


PAGE 287 


Sig ad ade Bh 


z = 0.24, cannot reject Ho. 

z = 2.43, reject H, (signal could contain a message). 
z = —0.11, cannot reject hypothesis of randomness. 
z = —4.00, reject Ho. 


CHAPTER 11 


PAGE 302 


Se ae 


jomk 


(a) a = 39.05, b = 0.764; (b) y’ = 65.8. 

11.86 < B < 17.12. 

y’ = 3.34 + 24.93x; 22.32 < B < 27.54. 

53.01 <a + Bx < 61.99. 

(a) b = ¥ xy/X x?; (b) b = 14.75; (c) b = 25.48. 
p= —2,24 + 0.100x. 


PAGE 317 


> 


y’ = 2.00(0.988)*. 

y’ = 28,3¢9-000153%, 

(a) ? = 1.50; (b) 1.39 <<} < 1.61, assume log p, are independent 
and normally distributed with equal variances. 

0.928. 
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9. y’ = 1.751 — 0.316x + 0.025x?. 
11. (a) y’ = 10.48 — 0.383x, t = —2.27, not significant, 67 = 1.70; 
(b) 63 = 0.27, F = 5.3, not significant. 
15. y’ = 961x, + 2,976x, — 16,279; $34,061. 


PAGE 328 

1. (a) r= 0.81; (b) z = 5.16, reject Ap. 

3. (a) r = —0.58; (b) —0.85 < p < —0.07. 

9. z= 0.627, cannot reject H. 
11, (a) 0.47 < p < 0.95; (b) —0.33 < p < 0.52; (c) —0.75 < p < 0.52. 
13. (a) r’ = 0.70; (b) r’ = 0.83; (c) r’ = 0.92. 

CHAPTER 12 

PAGE 342 


5. F = 0.68, not significant. 
7. F = 2.80, not significant. 
9, F= 11.3, significant. 


PAGE 354 

1. For threads, F = 8.3, significant; for instruments, F = 0.06, not 
significant. 

3. For students, F = 20.6, significant; for forms, F = 0.51, not sig- 
nificant. 


5. For flow, F = 7.8, significant; for precipitators, F = 8.1, significant. 
13. A CG B 
72 76 86 
bp. 2 4 3 5 1 
25.65 23.70 23,508 20.90 20.68 


PAGE 364 


1. For treatments, F = 6.55; significant; 
for rows, F = 1.29; not significant; 
for columns, F = 26.83; significant; 
for replicates, F = 0.02; not significant. 
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3. For golf-ball designs, F = 66.6; significant at 0.01 level; 
for pros, F = 24.3; significant at 0.01 level; 
for drivers, F = 2.73, not significant; 
for fairways, F = 73.3; significant at 0.01 level. 
5. For needles, F = 92.5; significant; 
for threads, F = 13.0; significant; 
for operators, F = 1.3; not significant; 
for machines, F = 0.2; not significant; 
for weeks, F = 0.9; not significant. 


PAGE 372 


1. F = 10.3; significant; 6 = —57.0. 
3. For analysis of covariance, F = 14.2, significant at 0.05 level; 
for one-way analysis of variance, F = 15.0, also significant. 


CHAPTER 13 


PAGE 389 


1. Significant F—values for temperature (F = 11.5, « = 0.01), concen- 
tration (F = 48.0, « = 0.01), and their interaction (F = 3.8, 4 = 
0.05). Optimum conditions are 10 grams at 75°F, with 37.32 < yu 
< 46.02. 

3. For detergents, F = 0.05; not significant; 
for machines, F = 6.9; significant at 0.05 level; 
for interaction, F = 0.81; not significant. 

5. Hours, dosages, and the strain-dosage interaction are significant 
at the 0.01 level with corresponding F values of 39.4, 22.9, and 7.3; 
the hours-dosage interaction as well as replications are significant 
at the 0.05 level with corresponding F values of 3.8 and 5.1. 

7. The F values for C and D are 12.7 and 86.5, respectively; both are 
significant at the 0.01 level. 


PAGE 401 


1. (a) SST = 110.44, SS(Tr) = 103.94, SS(Rep) = 1.56, SSE = 4.94; 
(6) (4) = 1, (l= —13, (48) = 3, (Cl = —95, fac] = 4, 
[BC] = —29, [ABC] = 3; 
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(c) SSA = 0.06, SSB = 10.56, SS(AB) = 0.56, SSC = 39.06, 
SS(AC) = 0.56, SS(BC) = 52.56, SS(ABC) = 0.56; 

(e) B,C, and BC are significant at the 0.01 level, corresponding F 
values are 14.9, 55.0, and 74.0. 

A and E are significant at the 0.01 level with corresponding F values 

of 13.4 and 18.2; B, C, AD, ACD, BCD, and BDE are significant at 

the 0.05 level with corresponding F values of 6.0, 6.3, 6.6, 5.0, 4.4, 

and 4.4. 

C, B, BE, BC, BCE, A, AC, AB, ABD, and ABC are significant at 

the 0.01 level with corresponding F values of 408.0, 1051.9, 18.2, 

18.2, 15.9, 2891.3, 11.7, 149.7, 15.9, and 46.6; E and ACE are sig- 

nificant at the 0.05 level with corresponding F values of 6.6 and 6.6. 


mn ——~ 
&, = 64.8, (4B) oo = — 35.1, ip = 84.0, and (@BY)oo0 == — 35.0. 


PAGE 415 


1. 


11. 


13. 


15. 


(a) Block I: 1, b, ac, abc, ad, abd, cd, bcd, ae, abe, ce, bce, de, bde, 
acde, abcde; 
Block 2: a, ab, c bc, d, bd, acd, abcd, e, be, ace, abce, ade, abde, 
cde, bcde; 

(b) Block 1: 1, ac, abd, bcd, be, abce, ade, cde; 

Block 2: a,c, bd, abcd, abe, bce, de, acde; 
Block 3: 6, abc, ad, cd, e, ace, abde, bcde; 
Block 4: ab, bc, d, acd, ae, ce, bde, abcde. 

32 blocks of size 2; confound on AB, AC, DE, DF, and AD. 

The intrablock SSE equals 4.39; B, C, and BC are significant at the 

0.01 level with corresponding F values of 14.5, 53.5, and 72.0 

(a) Aand BCD, Band ACD, Cand ABD, Dand ABC, ABand CD, 
AC and BD, AD and BC. 

(b) 1 and BC and ABD and ACD, A and ABC and BD and CD, 
Band C and AD and ABCD, AB and AC and D and BCD. 

1, be, adf, aeg, defg, abcdf, abceg, bcdefg, bdg, bef; cef, abfg, acfg, 

abde, acde, cdg, ab, ac, bdf, beg, cdf, ceg, acdefg, abdefg, de, fg, adg, 

aef, bcde, bcfg, abcdg, abcef. 

(a) 4920, —360, —420, —144, —84, —68, 12, —40, —374, 118, 
—126, 122, 22, 2, —22, —22, —84, 16, —32, 64, —80, 44, —24, 
120, —138, 70, —90, 42, —54, 18, —2, 82; 

(b) [7], [A], [B], [48], [C], [AC], [BC], error, [D], [AD], [BD], error, 
[CD], error, error, [EF], [EZ], [AZ], [BE], error, [CE], error, error, 
[DF], [DE], error, error, [CF], error, [BF], [AF], [F]; 
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(c) MSE = 123.41; A, B, and D are significant at the 0.01 level with 
F values of 32.8, 44.7, and 35.4, respectively; AB is significant at 
the 0.05 level with an F value of 5.2. 


CHAPTER 14 


PAGE 430 


1. (a) central line = 0.150, UCL = 0.153, LCL = 0.147; 
(b) central line = 0.005, UCL = 0.010, LCL = 0; 
(c) x: 8th, 16th, and 17th sample values outside limits; 
R: All sample values within limits. 
3. (a) central line = 48.1, UCL = 50.3, LCL = 46.0; 
(b) central line = 2.9, UCL = 6.7, LCL = 0; 
(c) process mean out of control, process variability in control; 
(d) z = —2.12, there is a trend; 
(e) no, process is not in control. 
5. (a) x? central line == 21.7, UCL. = 25,2, LCL = 18.2: 
ao: central ine = [.05, UCL = 3.74, LCL = 0: 
(b) yes, process is in control. 
central line = 0.0017, UCL = 0.0042, LCL = 0. 
(a) central line = 0.037, UCL = 0.093, LCL = 0. 
11. Yes, central line for c-chart is 4.9, UCL = 11.6, and LCL = 0. 


le ie 


PAGE 435 


1. Wecan assert with 95% confidence that 99% of the pieces will have 
yield strength between 36,897 and 68,703 psi. 
3. (a) 0.1063 + 0.0008; (b) 0.1063 + 0.0001. 


PAGE 446 

1. (a) 0.046, 0.139; (b) 0.059, 0.143. 

3. Producer’s risk = 0.143, consumer’s risk = 0.082. 

7. AOQOL + 0.025. 

9. & = 0.15. 
11. Sample size is 32, acceptance number is 5, rejection number is 6. 
13. a, = —1.67 + 0.19n, r, = 2.14 + 0.19; reject on 8th trial. 
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CHAPTER 15 
PAGE 458 
1. 0.9936. 
3. 0.9983. 
5. (a) f® = Bil —t/a)e"#"- for 0 << t <a, 


f(t) = 0 elsewhere, 
F(t) = 1 — eFC -"4*" for 0 =< f <= a, 
F(t) = 1 — e-*4? fort > a. 


7. (a) 0.6703; (b) 0.6703. 
o.. Si, 
11. (a) 0.6065; (b) 0.9502; (c) 0.3679. 
13. (b) F(x) = 1— (1 — p)* for x = 1, 2, 3,...; (c) 0.301. 
PAGE 469 
1. (a) 2,635 < uw < 11,002; (b) 7, = 39,810, reject Ho. 
3. (a) 65.0 < uw < 368.9; (b) 7, < 2511.25, cannot reject Ap. 
5. (a) c* = 998.7; (b) ¢* = 24.1. 
7. & = 0.0039, B = 0.536; R(1000) = 0.854; using exponential model, 
& = 0.126 per thousand hours. 
9, &@=6.75 x 10-5, ® = 1.51, @ = 580F (1.66) = 523. 


in al > 





A 


Accelerated life testing, 462 
Acceptable quality level, AQL, 437 
Acceptance sampling, 436 

double sampling, 440 

multiple sampling, 440 

sequential sampling, 444 

single sampling, 437 
Addition formula, 17 
Additive set function, 15 
Aliasing, 413 


523 


Analysis of covariance, 368 
table, 372 
Analysis of variance table, 340, 341, 
349, 360, 382, 387, 400 
dummy, 408 
interblock, 411 
intrablock, 410 
Arithmetic probability paper, 143 
Average, 67, 102 
Average outgoing quality, AOQ, 438 
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Average sample number, ASN, 
curve, 442 
Axioms of probability, 18, 78, 98 


Balanced design, 357 
Balanced incomplete block design, 
design, 364 
Bar chart, 54 
Bayes rule, 39 
Bayesian estimation, 188, 246 
Bayesian inference, 40 
Bernoulli trials, 55 
Beta distribution, 120 
Binomial coefficient, 56 
Binomial distribution, 56 
normal approximation, 108 
Bivariate normal distribution, 322 
Block, 346 


C-chart, 430 

Categorical distribution, 137 
Causality, 327 

Central limit theorem, 169 
Chebyshev’s theorem, 73 
Chi-square distribution, 176 


Chi-square test, 234, 257, 262, 265 


Class: 

boundaries, 138 

interval, 138 

limits, 137 

mark, 138 
Coefficient of variation, 152 
Conditional distribution, 129, 190 
Conditional probability, 32, 34 


Confidence interval: 

for means, 186 

for MTBF, 463 

for proportions, 243, 246 

for regression coefficients, 298 

for standard deviations, 231 

for variances, 230 
Confounding, 406, 412 
Consumer’s risk, 437 
Contingency table, 261 
Continuity correction, 110 
Continuous random variable, 96 
Continuous sample space, 7 
Control charts: 

for fraction defective, 426 

for means, 422 

for number of defects, 429 

for number of defectives, 428 

for ranges, 424 

for standard deviations, 432 
Correction factor for finite 

populations, 167 

Correlation and causality, 327 
Correlation coefficient, 322 

multiple, 330 

rank, 330 
Critical region, 199 

for testing MU = Up, 212 

for testing MU, ~ Uy =6, 216, 219 

for testing 0? = fas Zo3 

for testing 0,7 = 07°, 236 

for testing p = pg, 253 

for testing py = p2, 255 

for testing B = By, 299 
Cumulative distribution, 54, 138 
Curvilinear regression, 306 


D 


Decision making, 46 
Decision theory, 180, 207 
Defining contrast, 413 
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Degrees of freedom: 

chi-square distribution, 176 

F-distribution, 177 

t-distribution, 173 
Difference between means, 215 
Difference between proportions, 255 
Discrete sample space, 7 
Discrete uniform distribution, 164 
Distribution, 52, 100 

beta, 120 

binomial, 56, 108 

bivariate normal, 322 

categorical, 137 

chi-square, 176 

conditional, 129, 190 

cumulative, 54, 138 

exponential, 119, 454, 462 

failure-time, 452, 465 

FATT 

gamma, 117 

geometric, 83 

hypergeometric, 61 

joint, 126 

log-normal, 114 

multinomial, 89 

normal, 105 

percentage, 139 

Poisson, 77 

posterior, 190, 247 

prior, 247 

sampling, 163 

symmetrical, 59 

t, 173 

uniform, 113 
Distribution-free methods, 272 
Double sampling, 440 
Duncan multiple range test, 352 


Effect total, 396 
Efficiencey, 271 
Equitable game, 48 


Errors, Type land Type II, 195 
Estimation: 

Bayes, 190 

efficient, 182 

point, 180 
Events, 9 

independent, 35 

mutually exclusive, 10 
Expected value, 45, 102 
Experiment, 5 
Experimental design, 331 
Exponential distribution, 119, 

454, 462 

Exponential regression, 307 
Extrapolation, 301 


ee 


F-distribution, 177 

F-test, 235, 337 

Factorial notation, 56 

Failure rate , 453, 465 

Finite population correction 
factor, 167 

Fisher Z transformation, 325 

Fractional replication, 412 

Frequency distribution, 136 

Frequency polygon, 141 


G 


Gamma distribution, 117 
Gauss-Markov theorem, 295 
General law of addition, 26 
General law of multiplication, 35 
Generalized interaction, 406 
Geometric distribution, 83 
Gompertz curve, 318 
Goodness-of-fit test, 264 
Graeco-Latin square, 362 

Grand mean, 336 
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H 


H test, 278 
Histogram, 53, 141 
Hypergeometric distribution, 61 


Incomplete block design, 364, 406 
Independent events, 35, 36 
Independent random variables, 323 
Interaction, 377, 381, 385 
genralized, 406 

Interblock error, 411 

Interval estimate, 185 

Intrablock error, 409 


Joint probability density, 126 


K 


Kolmogorov-Smirnov test, 285 
Kruskal-Wallis H test, 278 
Kurtosis, 72 


Latin-square design, 357 
Law of large numbers, 75 
Least significant range, 352 


Least-squares criterion, 293 
Level (of a factor), 375 

Level of significance, 211 

Life testing, 461 

Limits of prediction, 300 
Linear regression, 290 
Log-normal distribution, 114 
Lot tolerance percent defective, 


LTPD, 437 


Mann-Whitney test, 275 
Marginal density, 127 
Mathematical expectation, 44, 69 
Mean: 
calculation of, 153 
of a probability density, 102 
of a probability distribution, 68 
of a sample, 149 
sampling distribution of, 173 
standardized, 168 
standard error of, 168 
weighted, 158, 190 
Mean time between failures, 
MTBF, 455 
Median, 150, 157 
runs above and below, 284 
Military Standard 105 D, 442 
Mode, 142 
Moments, 72, 101 
Monte Carlo methods, 91 
Multifactor experiments, 383 
Multinomial distribution, 89 
Multiple comparisons tests, 352 
Multiple correlation, 330 
Multiple regression, 313 
Multiple sampling plan, 440 
Mutually exclusive events, 10, 17 
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N 


Nonparametric methods, 272 
Nonreplacement test, 461 
Normal distribution, 105 
approximation to binomial, 108 
bivariate, 322 
circular, 131 
standard, 105 
Normal equations, 293, 309, 313 
Null hypothesis, 209 


O 


Odds, 19 

One-sample sign test, 272 

One-sample ¢ test, 213 

One-tail test, 199 

One-way classification, 335 

Operating characteristic (OC) curve, 
203, 437 


p-chart, 426 

Paired-sample sign test, 274 
Paired-sample ¢ test, 220 
Parallel system, 450 
Parameter, 56, 61, 163, 181, 394 
Percentage distribution, 139 
Percentiles, 152 

Personal probability, 19 
Pictogram, 149 

Pie chart, 148 

Point estimate, 180 

Poisson distribution, 77, 88 


Poisson process, 81 
Polynomial regression, 309 
Population, 160 
Posterior distribution, 190, 247 
Posterior probability, 247 
Prediction limits, 300 
Prior distribution, 247 
Prior probability, 247 
Probability: 
axioms, 18, 78, 98 
conditional, 32, 34 
Probability density, 99 
joint, 126 
marginal, 127 
Probability distribution function, 
52 (see also “‘distribution”’) 
Probability graph paper, 143 
Probability integral transformation, 
132 
Producer’s risk, 437 
Product law of reliabilities, 451, 456 
Proportions, 26 
confidence limits for, 243 
estimation of, 241 


Q 


Quartiles, 152 
Queue, 82, 84 


R-chart, 424 

Random digits, 92 

Random normal deviates, 134 

Random sample, 161, 162 

Random variable, 52 
standardized, 106 
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Randomization, 333 
Randomized block design, 346 
Randomized design, 335 
Range, 152, 228 
Rank correlation, 330 
Regression: 

coefficients, 295, 298 

curve, 290 

curvilinear, 306 

exponential, 307 

multiple, 313 

polynomial, 309 
Relative frequency, 18 
Reliability, 450 
Reliability function, 452 
Replacement test, 461 
Replication, 351 
Response surface analysis, 313 
Robustness, 270, 336 
Rule of elimination, 37 
Run tests, 283 


Sample, 161 
Sample size: 
for estimating means, 184 
for estimating proportions, 245 
Sample space, 5 
Sampling distribution, 163 
of the mean, 173 
of the variance, 176 
Sequential sampling, 444 
Series system, 450 
Set function, 15 
additive, 15 
Sign test, 272 
Significance test, 211 
Simulation, 91, 131 
Single sampling plan, 437 


Skewness, 60 

Smith-Satterthwaite test, 226 

Special law of addition, 27 

Special law of multiplication, 36 

Standard deviation, 70, 151 

Standard error of estimate, 297 

Standard error of the mean, 168 

Standard normal distribution, 105 

Standard order, 395 

Standardized mean, 168 

Standardized random variable, 106 

Statistic, 181 

Student-t distribution (see “‘t 
distribution’’) 

Subjective probability, 19 

Symmetrical distribution, 59 


t distribution, 173 

t test, 213,218, 220, 299 
Table of signs, 396 

Tolerance limits, 434 

Tree diagram, 8 

Two-factor experiments, 375 
Two-tail test, 199 

Two-way classification, 346 
Type I and Type II errors, 195 


U 


U test, 275 

Unbiased estimate, 181 

Uniform distribution, 113 
discrete, 164 
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V 


Variance: 

calculation of, 152 

of probability density, 102 

of probability distribution, 70 

of sample, 151 

sampling distribution of, 176, 228 
Venn diagram, 11 


W 


Waiting time, 119, 454 

Weak law of large numbers, 75 
Weibull distribution, 121, 455 
Weighted mean, 158, 190 


x 


x chart, 422 


Y 


Yates’ method, 399 


Z transformation, 325 











0-13-711945-3 


Mies 
—— -—# 






Pete inecian 


